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Welcome! 



Welcome to this course on Quantum Mechanics. It is aimed at students who have not 
had any exposure to the subject before. Hopefully you enjoy this learning material. Extra 
material can be found on the course website: 



http://www.bramm.be/qmcourse 



Background material 



The course assumes familiarity with basic calculus skills like differentiating, integrating, 
and complex numbers. Also, the reader should be familiar with the notion of vector spaces 
and matrices. On the physics side, it is probably useful to have knowledge of classical 
mechanics and electromagnetism. If you know these subjects, you should definitely be 
able to master this course. 

Style 

This course is written in a rather loose style. The emphasis is mainly on physical under- 
standing and not on mathematical rigor. Also, all chapters (with exception of the first 
one) have a fixed outline: 



M 



First, we introduce some necessary tools. Typically some math or a short 
physics review. 

IwP After that, a story is told. This is the theory/history part of the chapter. 

^B Finally, we combine the two previous ingredients (tools and story) to get to the 
physics of the chapter. Typically this involves a computation and some applica- 
tions. 

Good luck! 

Bram Gaasbeek, juli 23, 2010 
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Part I 



The Laws of Quantum Mechanics 



Chapter 1 

Intro: what is Quantum 
Mechanics? 

In this chapter... 

In this chapter you will learn the basic 'quantum concepts'. You may find them a bit 
different from things you have learned before, but don't be afraid, they are far from in- 
comprehensible. After reading the first few chapters, you may even see quantum mechanics 
as a quite natural description of the world on its smallest scale. 



CHAPTER 1. INTRO: WHAT IS QUANTUM MECHANICS? 9 

1.1 Unequal starts: reading guide 

As with any course, the biggest problem is the unequal start. Some students have learned 
more than others, some less. As a consequence, every course will always start off too fast 
for some and too slow for others. 

To solve this problem, there are several appendices to this chapter. They contain 
things you should know before reading this course. However, you might know quite some 
bit of it already. The descriptions below should help you decide which appendices you 
should read, and (more important) which not. 

• Appendix 1: Complex numbers recap. 

Only needed if your knowledge of complex numbers is poor. Review of complex 
conjugate, norm, inverse, e*^, etcetera. 

• Appendix 2: Moving waves 

Review of traveling waves like y{x,t) = sin(/cx — ut) or y{x,t) = eHfca;-a;t)_ jsjggfjg,;! 
if you are not familiar with the concepts 'wave amplitude', 'wave number', 'wave 
speed' or 'complex wave'. 

• Appendix 3: Maxwell's equations. 

Reviews how electromagnetism can be summarized in the four laws of Maxwell. If 
you don't know these laws by hard but understand them, you certainly do not need 
to read this appendix. 

• Appendix 4: Light is an electromagnetic wave. 

Needed if you never learned about this fact, or if you have never seen how the 
Maxwell equations imply the existence of waves. 

• Appendix 5: What is interference? 

Needed if you do not know the calculation of interference of (light)waves falling 
through two narrow slits. 

If you know all of this, congratulations; forget about those appendices and read on. If 
there are some things you don't know yet, take your time to read them carefully - they 
will help making sense of what follows. Again: restrict your attention to what you don't 
know - ignore the rest. After catching up, return to this point. 

1.2 Quantum Mechanics, what's up? 

As a warmup, some quick questions and answers about Quantum Mechanics... 
Why would I learn Quantum mechanics? 

Let's think a bit about the physics you have learned so far. Throughout the past years, 
you got to know how bodies experience and exert forces, how gases behave, how electro- 
magnetic forces act and all that. On top of that, you learned that molecules constitute all 
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the matter we see in daily life. These molecules are arranged in gases, solids and liquids, 
mixed up in one big soup - the world around us. Better even, those molecules are made 
of atoms, and each single atom is a cloud of electrons flying around a nucleus of protons 
and neutrons. That's pretty impressive. Indeed, this insight explains almost the entire 
reality around us in terms of a very small set of particles: electrons, protons and neutrons. 
However impressive and unifying this fact may be, it does leave us with some very real 
questions. Questions that you -most likely- have not gotten an answer to yet. Why do 
those particles stick together in atoms? Why do they sit together very very close, but not 
on top of each other? And what are these particles really? Just points? Or do they have 
some size? How do they behave? And so on. Precisely these questions are answered by 
quantum mechanics. 

What vi^ill I learn in this course? 
So, in this course, you will learn what those particles are, really. This means we will 
describe reality on the smallest scale. This description will be different from the physics 
you have learned so far. That does not mean one or the other is wrong. They are just 
descriptions of different physical situations. If you want to describe the dynamics of 
individual particles (or just a few of them) then you have to use quantum mechanics. For 
gases, solids, and all that, we don't use this description: it would be both impractical, 
and impossible. Indeed: how could we describe a gas by mathematically keeping track of 
all particles of such a system at the same time? We can't, that's why -for larger systems- 
we use gas laws, heat laws, Newton's laws, and so on. Those laws are very accurate for 
macroscopic systems, but one should always bear in mind that they actually follow from 
the dynamics and physical laws that govern the microscopic world - that of particles. That 
is what makes Quantum Mechanics (short: 'QM') - the fundamental theory of microscopic 
systems - so interesting. 

Is QM difficult? 
QM may be more difficult than the physics you have learned up to this point, but definitely 
not 'impossible to understand' or 'comprehensible to only few'. You may wonder why it 
is more difficult. Why do we understand -for example- gas laws more easily then the 
quantum mechanical laws that describe fundamental particles? Actually, there is a simple 
answer to that question. Gas laws deal with systems of our scale, of our daily life. We 
have quite some feel for it, even before studying physics. You were not really shocked when 
learning that the pressure of a gas rises when you compress it, right? This is just because 
you have experienced this law many times (f.e. when you inflate a bike tire) without really 
realizing it. However, none of us 'feel' or 'see' fundamental particles in ordinary daily life. 
If we did, the laws of quantum mechanics would feel very natural. You would just say 'ah 
yes, thats indeed what it looks like' or 'ah, yes, a particle indeed moves like that' - in the 
same way some people react on gas laws. Just keep in mind that our gas laws (or any 
other macroscopic physics) would look equally strange and funny for -say- some exotic 
submicroscopic life form only used to the laws of QM. 

How sure are we about QM? 
Or better even: how did we ever discover the laws of quantum mechanics? Like any other 
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branch of science: by doing experiments and thinking about the results. You may know 
that the basic laws of quantum mechanics were discovered about a hundred years ago. 
Prehistory, that is, in terms of technology and science. Since their discovery, the laws of 
quantum mechanics have been confirmed by numerous experiments, often with a higher 
precision than anywhere else. So, it is no exaggeration to state that quantum mechanics 
is the best-tested theory of physics. On top of that, the number of inventions directly 
using quantum mechanics is significantly bigger than most people think, and very likely 
to keep on growing in the very near future. In conclusion, despite its beauty and exotic 
character, quantum mechanics is not at all 'hypothetical' or 'merely philosophical'. It is 
an accurate, tested and elegant description of the smallest and most fundamental part of 
the reality around us, that of elementary particles. 

1.3 How it all began... two slits 

Quantum mechanics was born together with a series of remarkable experiments. One of 
them was the so-called two-slit experiment^. The first version of this experiment was 
actually carried out by Thomas Young in 1803. Some people back then were arguing 
about the true nature of light. In our daily life, we see that light travels on straight lines 
- you can't look around a street corner, right? This suggests light might be made up of 
tiny particles or 'bullets' flying through space. However, some people -like Mr. Young- 
were convinced light has some wave aspect to it. Indeed, as you learned in your course on 
electromechanical, you know that light is a wave in the electric and magnetic field. So to 
describe it, you need to give the distortion in the E and B field, depending on space and 
time. So you need to give E{x,t) and B{x,t) to describe a traveling piece of light. This 
contrasts with a particle, which can be described by only its position depending on time: 
x{t). Anyhow, Mr. Young did not know all the fancy formulas of Maxwell, so how did he 
find out light is a traveling wave, not a traveling particle? 

So, ho^v did he find out? — The two-slit experiment. 

He set up the following experiment: take a light source, a thin wall and a projection screen, 
as in Figure [TTTJ If you cut -for example- the shape of a cat away from the wall, and shine 
on it, you see a bright shape on the projection screen, in the shape of a cat. Now, what 
happens if instead two very narrow slits are drilled, on a small distance of each other? 
Naively, you expect two very narrow bright stripes, on a small distance of each other. 
Well, strange enough, that's not what you see. Instead, you see an alternating series of 
bright and dark bands. Those bands are precisely what you see when waves create an 
interference pattern. So Young could do nothing but conclude that indeed, light must be 
a wave, even though he didn't know what kind of wave. Nice, huh? 



^which you either learned about before, or have read about in Appendix 5 



■^or else in Appendix 4 
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But what about that cat shaped hole then? Well, for larger holes, the interference 
pattern becomes much more complicated. It is still there, really, but you barely see it. 
Moreover, it turns out that for larger holes, the spot on the screen gets the same form 
as the hole. From this, it looks like light just consists of 'bullets' passing through, even 
though that's not really true. Got it? 



what you EXPECT: 




what you OBSERVE: 



light source 



light source 




light source 



Figure 1.1: If you shine with a light source on a hole with the shape of a cat, light will 
impact along a cat-shaped region (left). Logically, you would guess that two very narrow 
slits lead to two vary narrow bands of light impacts (middle). Striking enough, what you 
observe is a whole series of bands (right). The conclusion is inevitable: light must be a 
wave. 



The surprise 

Anyhow, nowadays we understand Young's experiment very well, since now we know 
precisely what kind of wave light is: an electromagnetic wave. But here comes the surprise. 
Let's do the two-slit experiment again, but now for electrons. More precise, tell your 
engineering friend to build a nice electron-source: an apparatus shooting electrons. He'll 
readily do that for you. Of course, as we can't see flying electrons with our eyes, we have 
to replace the projection screen by a set of detectors, which can measure the electrons 
arriving. So the setup is exactly that of Figure 11.11 but with an electron source and a 
detecting screen instead of a light source and a projection screen. When sending electrons 
through the cat-shaped hole, the detectors will tell you that a cat-shaped region is under 
fire. Ah, ok, that's what you expect. Now do as Young did for light: cut out two narrow 
slits. What do the detectors tell you now? Let's think about it. Electrons really are 
particles (so imagine flying 'bullets') so two slits should result in just two bands where 
electrons are detected. However, that is not what happens. Precisely like for light, you 
see an alternating series of bands. On one band, there is much detection, then on the next 
there is few to none, and so on. Even if lower the intensity of the source (so that there 
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is more time between the electrons, and you are sure they fly at the slits one by one, not 
feeling the previous or next one) the result is the same. Bands, not just two stripes. 

Interpretation 

There is only one conclusion we can draw. Each electron has to be a wave. Well, more 
precise, it is just a small piece of wave: a wavepacket. To a brutal and large observer like 
us, that small packet looks like a point particle in most experiments. However, it does have 
a non-zero size. It is a very small blob. And when each such wave packet goes through 
the two slits, it interferes with itself. (The part of the wave packet that went through one 
slit interferes with the part that went through the other slit.) So the wavelike nature of 
the electron is only made visible because of the specific setup of the two-slit experiment. 
In most other experiments, we don't notice, and the electron just looks like a point, even 
if, in fact, it is spread out over a small region. Now, if you repeat the experiment for 
neutrons and protons, you find precisely the same result. So they are all wave packets. 
People like to call this the wave aspect of particles. 

1.4 Another experiment: the photoelectric effect 

Initially, the two-slit experiment completely stunned physicists. But the origin of QM lies 
at several experiments and puzzles that (roughly) arose around the beginning of the 20th 
century. One of them was the photoelectric effect. To see this effect, do the following. 
Take two pieces of metal. Take a direct current source, and connect it two the pieces of 
metal, as shown in Figure 11.21 Since the metal pieces are not connected, no current can 
flow through the circuit. That is indeed what you see experimentally (your ammeter will 
indicate the current is zero). Well, wait: now do the following. Take a light source, of 
which you can adjust the color (the frequency). Since you can set it to shine at only one 
color, you call such a source monochromatic, which is Greek for one-color. Start with 
very red (low frequency) light, and shine on the negatively charged piece of metal (see 
figure). You still don't see anything special. Now if you increase the frequency of the light 
(go from red to yellow to green, towards blue light) there is suddenly some frequency/color 
at which you detect a current running. What? The only explanation is that the light strips 
off electrons from the metal; those fly through the air towards the (attracting) positive 
pole, making you detect a current. 
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pT light 



h| — e 



source 



ammeter 



Figure 1.2: A setup showing the photoelectric effect. A source is connected to two pieces 
of metal. They get charged (electrons are displaced) but no net current flows through the 
circuit. When light shines on the negatively charged metal piece, things change. At least: 
if the light has a high enough frequency, it strips electrons out of the metal, so they can 
cross the gap, and reach the other side. This will cause a current to flow, as detected by 
the ammeter. On the other hand, if the frequency is too low, nothing happens, not even 
when one drastically increases the intensity of the light - the current stays exactly zero. 



Now scientists partly understood this experiment. They knew light (or EM radiation 
in general) is a form of energy, so it is not so strange that it can strip off electrons from 
a metal. The strange thing, however, is that it only starts doing so beyond a certain 
frequency (which depends only on the type of metal you are using). If the frequency 
is too low, there is zero current, regardless of the intensity! So if you are below that 
critical frequency, no matter how much light you shine on the metal, you don't detect a 
current. Once above the critical frequency (which depends on the metal), the current is 
proportional to the applied intensity, so the amount of electrons ripped off grows with the 
amount of radiation you shine on the metal. So what is going on here? 



Understanding the photoelectric effect 

This peculiar behavior remained unexplained for quite some while. Then, Albert Einstein 
proposed a simple explanation. Imagine that a ray of light consists of very many separate 
pieces. Let's call these fundamental bits of light photons. If you shine on the metal, all 
these photons bump into the atomic grid of the metal, one by one. Now on top of that, 
assume each photon has an energy proportional to its frequency. So 



E = hi' 



fl.l) 



where E is the energy of the photon, and v is its frequency (color). The factor of pro- 
portionality between the two, h is called Planck's constant. Ok, suppose this is true. 
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Einstein then reasoned further as follows. Say there is some energy <I> needed to strip off 
one electron. Then if the individual photons have an energy E that is smaller than ^, a 
single photon cannot strip off one electron. And as they arrive one by one (however fast 
that may be) each of them has insufficient energy to kick an electron out. So nothing 
happens. Only if the individual photons have an energy E greater than <1>, each of them 
can strip off one electron. In this case, the result is obvious: the more photons you send, 
the more electrons are released by the metal. Said differently: a higher intensity of light 
means more photons per second, so more electrons released, and a bigger current detected. 
A popular comparison is the 'ball and the fence'-story. Imagine you want to kick a ball 
over a fence. If you kick too soft, it will not go over. Even if you repeat this, the result 
will be rather poor: the ball will just bounce back over and over again. However, if you 
kick hard enough, it will fly over the fence at once. 

As you see: the explanation of the photoelectric effect is not particularly difficult. To 
understand it, you don't need to be an Einstein at all. Note that the above provides more 
than just an explanation. It also gives a prediction. Since an electron needs an energy ^ 
to be kicked out (this energy is actually called the ■work function), and since the photons 
gives an energy hu, the kinetic energy K of the electron after being kicked loose is given 
by: 

K = hiy-<^ (1.2) 

So the explanation of Einstein predicts a simple linear relation between the kinetic energy 
of the released electrons and the frequency of the light falling on the metal. With a 
slightly more careful setup one can measure the kinetic energy of the released electrons 
(by letting them cross a potential difference). And indeed, their kinetic energy precisely 
obeys the above relation. This result (and many other experiments) confirm that the 
photon-explanation has to be correct. 

Interpretation 

So what does the photoelectric effect tell us? Light must be made up of many individual 
particles, each with its own energy. They are really separate objects. How can we reconcile 
this with the knowledge that light is a wave? Each photon has to be a little piece of the 
electromagnetic wave. If you put many of these 'elementary pieces of wave' together, it just 
looks like one big wave. So the particle-like behavior of light is only made visible because of 
the specific setup of the photoelectric effect. People like to call this the particle aspect 
of light. 

Putting the pieces together 

The experiments we have just seen force us to think very differently about particles and 
radiation. Before the advent of quantum mechanics, people thought there was a very clear 
difference between radiation and particles. They thought particles didn't have any size 
and should be described by their position as a function of time. They thought radiation 
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is a disturbance of the background, and should to be described by an amphtude (of the 
electric and magnetic fields) as a function of place and time. Quantum mechanics learns 
us that this is not correct. Instead, everything we see around us is built of wave packets. 
If you see one such packet, as it tends to be very small, it may look like just a point of 
zero size. For radiation, if you see many of these wave packets at once, it looks like just 
one big wave, without obvious constituents - even if they really are still there. The idea 
that everything is built of wavepackets is really the core idea of quantum mechanics. We 
can actually summarize this entire chapter by one single drawing: 



wave 




wavepacket 



particle 



This picture will be made more quantitative in the next section. 



1.5 The de Broglie relations 

From the photoelectric effect, we know that photons have an energy proportional to the 
frequency. This is a quantitative expression of the particle aspect of light. There is also a 
formula expressing the wave aspect of particles. Here is a way to obtain it experimentally. 
You know (or have read in Appendix 5) that a two-slit interference experiment yields 
bright bands (a lot of detection) at positions x on the screen satisfying 



nXL 



(1.3) 



So from the a double-slit experiment, you can deduce the wavelength of the waves from 
the positions x of constructive interference. (Given that you also know L and d.) So if 
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you do the double-slit experiment with particles, you can measure the wavelength of these 
tiny wave packets. It turns out that the following holds: 

A = ^ (1.4) 

P 

where p is the momentum of the particles falling through the slits (their mass times their 
speed) and h is again the Planck constant. In conclusion: the photoelectric effect and the 
double slit experiment show that 

h E 

A = - and / = - (1.5) 

p h 

These equations are called the de Broglie relationso The first relation gives the wave- 
length associated to the wave packet of a particle. The second relation gives the energy 
of a photon in relation to its frequency. But actually, both relations hold for photons and 
for particles. So you can read the de Broglie relations in two ways. First, for a particle 
with energy E and momentum p, they give the wavelength and the frequency of the wave 
packet of the particle. Second, for a photon with frequency / and wavelength A, they 
give the energy and momentum of the photon. So a photon does not only have a certain 
amount of energy, it also has a tiny bit of momentum. Again, this puts particles and 
waves on a very symmetric footing. 

We can also rewrite the above two equations a bit. Recall that the wavelength of a 
piece of wave is related to the wavenumber k by the relation A = -^. Also, recall that the 
angular frequency a; of a wave is defined as w = 27r/. Then the above two equations 
can be written as 

p = hk and E = fuj (1-6) 

with ^ = 2^^: is called the reduced Planck constant. The experimental values of h and 
h areO 

h = 6.626 • 10~34j-s = 4.136 • IQ-^^eV-s (1.7) 

n = — = 1.055 • 10-34J.S = 6.582 • 10"^^eV-s (1.8) 

This constant is not just small, it's tiny - that's one of the reasons why the quantum- 
behavior of particles is only manifest in very precise and specific experiments. 



^The pronunciation of 'de Broglie' is somewhere in between 'duh Broy' and 'duh Brey'. 
''To see the conversion between units, recall that an electron volt (eV) is the energy you get from letting 
an electron cross a potential difference of one volt, so leV = ge(l^) = (1,6- 10^^^C)(1\^) = 1,6- 10~^^ J 
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1.6 Conclusion 

The main conclusion of the chapter is the fact that the fundamental building block of 
nature is a wavepacket (a lump of wave). This is quite nice, since it rids us of the classical 
idea that nature consists of two separate building blocks (particles and waves) . Whether 
we are talking about a photon, an electron, a proton, ... all these are a wave packet. Since 
and during the discovery of quantum mechanics, this fact has been proven again and again 
in experiments. Most textbooks mention not only the two experiments above, but several 
others. Many of those were equally important in the development of quantum mechanics. 
We have omitted those here, as they all lead to the same conclusion. If you are interested, 
go ahead and look them up. If you got the point though, maybe you don't really need to. 

Excercises 

1. Explain to yourself (or better: to someone else) what the conclusion of this chapter 
was, and how it logically follows from the discussed experiments. 

2. What is the momentum carried by a red photon? (Red light has a frequency / = 
450 • 10^^ Hz.) How many such photons do you need to obtain a momentum of 
Ikg-m/s (=a decently served tennis ball)? 

3. For every wave, Xf = v with A the wavelength, / the frequency and v the wave 
speed. Using that the speed of light is c = 3 • 10^ m/s, what is the wavelength of a 
red photon? 

4. You are doing the double-slit experiment with electrons. The distance between the 
slits is 1 cm, and the impact screen is at 1 meter from the slits. What speed should 
the electrons have in order to obtain an interference pattern with the bright bands 
separated by 0.1 m? (The mass of an electron is rrie = 9.11 • 10~^^ kg.) 

5. The work function $ of silver is about 4.6 eV. What frequency do photons need to 
have in order to strip off electrons from a chunk of silver? Explain what happens at 
lower frequencies. What happens at higher frequencies? What is the kinetic energy 
of an electron stripped off by a photon of frequency 2 • 10^^ Hz? 
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Appendix 1: Complex numbers recap 

As you probably know, a complex number is a number of the form a + bi where a and b 
are real numbers, and i is the imaginary unit (sometimes denoted I or j) which satisfies 
i^ = —1. In such a decomposition, a is called the real part of z, and b is the imaginary 
part: 

a = Rez b = lmz (1-9) 

The product and sum of two complex numbers a + bi and c + di are given by: 

{a + b) + {c + d)i (1.10) 

{ac - bd) + {ad + bc)i (1.11) 

bi. With 



(a + bi) + (c + di) 
(a + bi) . (c + di) 



ac + adi + bci + bdi 



The complex conjugate of a complex number z = a + bi is given hy z = a 
this, the real and imaginary part of a complex number can be written as 



Rez 



z + z 



Imz 



The norm of a complex number is defined by \z\ 



z ~ z 
2i 



(1.12) 



number can be written in the form z 



z e 



ie 



with 



je 



cos 6 + i sin ( 



\/a^ + 6^. Also, each complex 

(1.13) 



The angle 9 is called the argument of z. 

6 = argz 



(1.14) 



All these quantities can be shown very graphically, as shown |m z 
in the figure on the side. 

For computations, there are some important properties 
that will show up again and again throughout this course. 
First, the inverse z^^ of a complex number (defined by re- 
quiring z^^z = 1) is given by 




(1.15) 



Rez 



Also, the conjugate of the sum and the product of two com- 
plex numbers zi and Z2 are given by: 



Zi + Z2 = Zi + Z2 ZiZ2 = ZiZ2- 

Similarly, the norm of the product of two complex numbers is given by 

\Z1Z2\ = \Z1\\Z2\. 



:i.i6) 



:i.i7) 



However, for the norm of the sum, one needs to be careful, since \zi + Z2\ ^ \zi\ + \z2\- If 
you have forgotten some of these things, you could do the following short exercises: 
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• Show that the norm of z = \z\e really is \z\. (Write out real and imaginary parts 
explicitly, sum the squares.) 



• 



Check that the expression for the inverse z ^ of a complex number indeed implies 
zz-"" = 1. 

Show the properties (J1.16p and (|1.17|) listed above. 



• Use these properties and (J1.15p to show that {Z1Z2) ^ = z^ Z2 ■ 

• Find an example of two complex numbers, such that \zi + Z2\ ^ \zi\ + \z2\. (You 
can even find examples in the real numbers, which are just a subset of the complex 
numbers.) 

Appendix 2: Moving waves 

Waves are all around us. Waves on water, sound, light, ... . In general , one can define 
a wave as a disturbance, throughout some medium. What this disturbance is, can 
differ significantly: for water waves, it is a disturbance in the height of the surface, for 
sound it is a disturbance in the pressure of the air, for light (see Appendix 4) it is a 
disturbance in the electric and magnetic fields. If we denote the quantity that is disturbed 
by y then a wave is described by a function y{x,t). Indeed, the disturbance typically 
depends on the position x, and changes throughout the time t. |j A nice feature is 
that for very, very many systems, 'small' waves (small disturbances) have the shape of 
a sine-function|£| Superposing such sine-waves (=taking the sum, the total disturbance) 
then gives a large class of possible disturbances traveling through space. So our central 
question here is: How can we describe such a sine-shaped w^ave? Well, such a wave 
can always be put into the form 

y{x,t) = Asm{kx-ujt + (p) (1-18) 

Let us dissect this expression. The right hand side indeed depends on the position x and 
time t. The other variables A, k, w, (p are constants. Since A multiplies the sine-function, 
it says how large the disturbance is, so A is called the ampHtude of the wave. To see the 
meaning of the other constants, first note that at time t = 0, the wave looks like 

y{x,t = 0) = Asm{kx + (j)) (1.19) 



^Possibly, the position is described by more than just one coordinate. You need two coordinates for a 
wave on the 2 dimensional surface of water, 3 for a sound wave moving through a 3 dimensional room, 
etcetera. For simplicity, we consider only one coordinate x here. To avoid confusion, we stress again that 
y denotes the quantity being disturbed, it is not a spatial coordinate like x. 

®Or a cosine function, which is of course just the same shape, but shifted. 
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This is a sine-function indeed. The distance between two peaks is cahed the wavelength 
A, and is given by 

X = f. (1.20) 

To check this, just note that if there is a peak at xi, there will also be a peak at xi + A, 
since 

y{xi + A,0) = ^sin(A;xi + 2tt + 4>) = Asm{kxi + </>) = y{xi,0). (1.21) 

The object k itself is called the wave number. What about </>? This constant determines 
how far the sine- function (|1.19|) is shifted to the left at t = 0. It is called the phase of the 
wave. Now let us take a different point of view. What if we stay put at x = 0, and watch 
how the disturbance at this specific position changes throughout time. Clearly, it is given 
by the function 

y{x = 0,t)=Asin{-ujt + (t)) (1.22) 

This looks very similar: again it is a harmonically oscillating behavior. Also, the distur- 
bance is periodic: it repeats itself throughout time. Indeed, the disturbance at t and t + ^ 
are the same: 

y{0,t + 2TT/uj) = Asin(-a;t + 27r + </>) = Asm{-ujt + (p) =y{0,t). (1.23) 

The time T = -^ is called the period of the wave. The number io is called the (angular) 
frequency. Here is what happens when we look at the entire wave, evolving through time. 
It starts of like the wave p.l9p at t = 0, and then it runs along the x-axis (maintaining its 
shape) with a constant speed v. To see what the speed is, you can use the following trick. 
To follow the wave on its movement, you should sit at one point (a peak for instance) and 
ride along. To ride along means you always see the same disturbance along your way. So 
you want to see y{x, t) to be constant, which is realized by kx — u}t = constant. Reworking 
this expression, you get 

constant + wt constant oj /, „,s 



k- 



This means you have to move at speed j. So the wave speed is given by v 

Complex waves 

In quite some fields of physics, it turns out that the propagating waves in a system are 
most naturally described by a complex wave. This does not mean something wacky is 
going on, it just means this gives a convenient description, with simple equations etcetera. 
For example, in electrical circuits it can be handy to describe an alternating current by a 
complex wave. Maybe you have once seen this trick, and realized that such a description 
saves a lot of ugly trigoniometric manipulations that would be needed when working with 
ordinary (real) functions. However, if you really wanted to, you could do so, meaning that 
nothing strange is going on. It is just a matter of choice. Real functions are easier to 
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Figure 1.3: A moving wave. The displacement y{x,t) is shown for times t = 0, t = 5t and 
t = 25t. Clearly, the wave is moving to the right with increasing time. The wave speed is 
given by u = ^ 



understand, but complex functions save a lot of work. Anyway, how would you describe 
such a complex wave? Well, very similar to the above: by a function 



y 



(x, t) = ^e^('=^'~^*+'? 



;i.25) 



ilm(f) 




Re(f) 

Figure 1.4: A graphical representation of a complex wave z = e^^. The real and imaginary 
parts of z are shown separately. The total figure has a helix shape. 



Actually, if we allow ^ to be a complex number, we can absorb the phase e^'^ in it. 
Such a wave can be visualized as in Figure 11.41 Note that the real and imaginary parts 
are just ordinary waves of the type we just discussed. Again, k is the wavenumber, uj the 
(angular) frequency. Can we also describe a complex wave in three dimensions? Sure, this 
would be given by 

Y{x, y, z, t) = Ae'^-^-''+^yy+^-'-^^+^^ = Ae'^^^-^^+^^ (1.26) 

(We capitalized the name of the wave to avoid confusion.) In this case, the vector k = 
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{kx,ky,kz) is called the wave vector. It can be shown that such a wave travels in the 

direction of k, with speed v = — = ^^-^ — - — -. 

To conclude, a remark about units. It is often stated that functions like the sine and 
exponential function can only have dimensionless arguments. Since x and t have units 
of length and time (meters and seconds), k and uj necessarily have units m^^ and s^^. 
With this, you can verify that the period T and wavelength A of a wave (with the above 
expressions) then have units time and length, as you would expect. 

Appendix 3: Maxwell's equations 

You have probably seen the laws of electromagnetism several times throughout your life. 
They just happen to be very important to a lot of interesting physical phenomena. Quite 
some physics books phrase these laws in terms of physical situations. Like: 'the magnetic 
field around a conductor carrying a current, is given by ...' or 'a changing magnetic 
flux will create an electric field inside the conductor, given by...'. These phrases are very 
important, but are not the most elegant description, since they require a lot of words and 
explanation. A quite nice fact is that all these laws can be summarized in a very compact 
way: only four equations. They are called Maxwell's equations, although they are only a 
different version of the laws discovered by others before him. They look as follows: 



V • ^ = ^ (Gauss's law) 

V • i? = (Gauss's law for magnetism) 

V X E = -^ (Faraday's law) 

V X B = fioJ + fiQEo^ (Ampere's law) 



Here, E and B are the electric and magnetic fields, p is the charge density, and J the 
electrical current. The constants Eq and /Uq are the electrical permittivity, and the magnetic 
permeability. The object V is a vector of derivatives: V = (^, ^, ^). So for example 

^^ ~ ~sf ~^ IbT ~'~ ~df' ' ^° what do these equations say? First of all, note that the relevant 
variables {E, B, p, J) depend on the position x and time. So they are local equations: 
they say how all involved quantities are related at every position and time. Also, the 
equations involve derivatives of E, B: the equations of Maxwell are four diflferential 
equations. Sometimes, people say the above expressions are the 'differential' form of the 
laws of electromagnetism. It might not be immediately clear how to extract real physics 
(say, the force between two conductors carrying a current) from them, but sure they are 
very compact and elegant! That's why people like them so much. A very important 
consequence that can be extracted from them, is described in the following appendix. 
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Appendix 4: Light is an electromagnetic wave 

A question Maxwell asked himself, was the following. What happens if there are no charges 
or currents present? Could something interesting happen then? In that case {p = 0, j = 0) 
the four equations become: 

V-E = (1.27) 

V X ^ = -—- (1.28) 

dt ^ ' 

V-B = (1.29) 

dE 
VxB = M^o^ (1-30) 

Taking the curl (Vx) of the second and fourth equation and then using them again gives: 

VxVxE = -|Vx^ = -^oeo^ (1-31) 

V X V X ^ = Moeo|V xE = -fioSo^ (1.32) 

After this, using the vector identity 

V X (v xv) =v(v-v) - VV (1.33) 

(valid for every vector function V{x)) and the first and third equation {VE = VB = 0) 
we get 



d^B 

9f2 



c'-V^E = (1.34) 

c^-V'^B = (1.35) 



where c = -j==. You may recognize that these equations are standard wave equations. 
That means they have solutions with the shape of a sine-function, like sin(A;j; — Lot). For 
example, if one takes the electric field to be 

^ = (0,Sosin(/cx-wt),0). (1.36) 

Then the x- and z- component of ()1.34p are trivially solved, and the y-component becomes: 

u? - (?k^ = (1.37) 

which can be satisfied by taking uj = ck. From ()1.28p and (ll.30p we see that we should 
take 

B = {0,0,Bosm{kx-ujt)) (1.38) 
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Figure 1.5: This figure shows an electromagnetic wave, with the electric and magnetic 
fields along the x-axis shown at a specific instant in time. If we let time run, the wave 
moves ('shifts') to the right. 



with Bq = —^. (Just check by plugging in that this works.) Clearly, the above also 
satisfies the wave equation p.34p . So what does this look like? This solution to the 
Maxwell equations is drawn in Figure 11.51 It is wave of oscillating electric and magnetic 
fields, traveling through space in the x-direction. Note that E and B are perpendicular to 
the direction of motion. We can intuitively understand why such a wave is possible. From 
electromagnetism, we know that a changing magnetic field induces a changing electric 
field, and vice versa. So the above wave precisely embodies this principle - even without 
any charges or currents being present. So the equations of Maxwell imply the existence of 
electromagnetic waves. That's nice, but there is more. The speed of this wave is given by 






:i.39) 



and the numerical value of c is given by plugging in the values of the permittivity and 
permeability: 

c = = 2, 998 X 10^ m/s (1.40) 

-y/A'oeo 

That's precisely the numerical value of the speed of light! From this. Maxwell concluded 
that light has to be a form of electromagnetic radiation. A very neat and surprising 
conclusion, and - better even - very right. Today we know that light is only one form 
of electromagnetic radiation. Microwaves, radio waves, X-rays,... are all electromagnetic 
waves, with the single difference of comprising waves of different frequency regions. 



Appendix 5: What is interference? 

An important property of waves is that they can be put 'on top of each other'. Such a sum 
of two waves is called a superposition. This can be seen in water waves for example. If a 
duck on water creates waves yi{x, t) and another duck makes waves 1/2(2;, t) then the total 
disturbance pattern they create is given by yi{x,t) + 2/2(2;, t). A very interesting thing 
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that occurs when two disturbances are superposed is interference. It could be that at 
some position and time, yi and y2 are exactly opposite. This means the total disturbance 
is zero, even if yi and 1/2 individually would have been nonzero. Such a situation is called 
destructive interference. Similarly, if yi and y2 have the same sign, the total wave will 
be larger than the individual disturbances. This is called constructive interference. You 
can easily reproduce these patterns yourself - by throwing two stones into an undisturbed 
pond for example. An experiment that makes constructive and destructive interference 
particularly clear is the double-slit experiment, shown in Figure II. 6i In this case, 
incoming waves fall on two narrow slits/holes. At the other side of the slits, the waves 
come out. In that region, the waves emerging from the two slits interfere. A question 
worth asking is what the total wave pattern will look like on a wall at some distance of the 
slits. For ease, we suppose the waves are of a single wavelength A. Also, suppose the wall 
is at a distance L >> A of the slits. Since the waves start out the same way at the slits, 
they must have the same phase there. This means that the disturbances at the two slits 
show maximum positive displacement (a crest) and maximal negative displacement (a 
trough) at the same moments in time. More precise, the wave amplitude y{x,t) satisfies: 

y(slitl,t) = y(slit2,t) Vt (1.41) 

Consider the pieces of wave traveling upwards, towards a position x on the screen. Note 
that the distance the two pieces of wave have to travel is not the same. This is called the 
path difference. From nTB] it is clear that the path difference A is given by 

A = (isin6' (1.42) 

Here is the crux of the story: if the path difference is a multiple of A, the two waves will 
have the same phase at x, so they will interfere constructively. If the path difference is 
A/2 plus a multiple of A, the wave will be exactly opposites when arriving at x. So in that 
case, destructive interference will occur. So 

A = nX (n integer) =^ constructive interference at x (1-43) 

A = (n + ^) A (n integer) =^ destructive interference at x (1-44) 

So combining (jl.42p with ()1.43p . we see that constructive interference occurs if 

dsin6' = nA (1.45) 

Note that the position x on the wall is hidden in 6. If L is large, 6 is small and sin 6 ~ 
tan^ = x/L. In this approximation, the above condition can be rewritten as 

nXL 
X = —— (1.46) 

At points X satisfying the above relation, constructive interference occurs, so the com- 
bined waves give rise to a wave with large amplitude. In between such points (at x = 
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Figure 1.6: An interference experiment: two narrow slits/holes, through which waves fah. 
The distance between the two holes is d. The distance between the holes and the wall 
on the right is L. We are interested at the total (superposed) wave arriving at position 
X, corresponding to an angle 9 (shown at two places). The path difference is clearly 
A = dsinO. On the right, the waves emerging from the two holes and traveling towards 
one specific x are shown, at a particular instant in time. Since the waves emerging from 
the holes necessarily have the same phase. Indeed, here, they both show a crest at the left 
most position. However, at the height of the wall, the two waves are out of phase. The 
upper wave shows a trough, whereas the lower wave shows a crest. This means destructive 
interference occurs at this particular x. This is not so surprising, since the path difference 
is precisely half of the wavelength. (Shown in the right corner.) 



(n + 2) AL/d) destructive interference occurs - the waves of the lower and upper slit ex- 
actly cancel each other. This means there is no disturbance at all at these points. In 
between the two extremes, there is a smooth transition. This pattern shows alternating 
bands (regions) of constructive and destructive interference, and is called the interference 
pattern. 



Chapter 2 

Schrodinger's equation 



In this chapter... 

The introductory chapter gave a quick quahtative view on one of the central theses of 
quantum mechanics, namely that the fundamental building blocks of all matter and radi- 
ation are tiny wave packets. As a consequence, we know that we should actually describe 
particles as waves, not just a moving points. The question that now arises is obvious: how 
can do so? How do we describe such a wave? And what is the dynamics of such a wave; 
how does it evolve in time? These questions will be addressed in this chapter. We will 
move on from a qualitative to a quantitative description. 

As mentioned in the foreword, all chapters will from now on have a fixed structure. 
The tool part of this chapter will be an introduction on the mathematics of operators. 
Then we tell the story of the Schrodinger equation. Finally, in the computation part we 
will do our first real quantum mechanical description of a particle. Good luck! 
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2.1 IHJ Operators and functions 

In this section, we will learn about operators. It will be a very central tool for the 
following chapters. First we explain what an operator is, then we give some examples. We 
end with the notion of an eigenvector and eigenvalue of an operator. 

2.1.1 An operator, what's that? 

About the most ubiquitous object in mathematics is the 'function'. Take for example the 
real function f{x) = x^. It maps each number to another number. More pictorially, we 
could say that such a function is a machine: if you feed it a number x, it will return to you 
a different number, x^. Now what is an operator? Its a machine too, but now the in- and 
output aren't numbers, but functions. Lets give a simple example. Given a nice smooth 
function /, we can compute its derivative /'. We can see the process of differentiation as 
a map: 

differentiation : / — 5- /' (2-1) 

We can do this for every nice smooth function, so 'differentiation' is an operator: if you 
feed it a function, it will return a new function, /'. If we denote this differentiation 
operator with D, we have 

D:f^ D{f) = f (2.2) 

So -D(x^) = 2x, D(e^) = e^, etcetera. We can give another example of an operator: take 
a number n, and define the operator X" as follows 

X":/^X«(/)=x"-/ (2.3) 

Since X" sends every function to a new function (x" times the original one) it's an operator 
indeed. Take for example X"^. We have X^(sinx) = x^sinx, X^(e^) = x^e^, etcetera. |^ 
Now both the above examples are of a special type: they are linear operators. We call 
an operator L linear if for all functions / and g and for all numbers a and b 

L{af + bg)=aL{f)+bL{g). (2.4) 

This indeed holds for the differentiation operator D, and also for the operators X". In 
fact, all operators we will meet from here on will be linear operators. They are the ones 
that are relevant for quantum mechanics. So, we will drop the adjective 'linear' from now 
on, and just talk about 'operators'. Also, we will try to stick to the following notation: 
a capital letter is an operator, while a minuscule denotes a function. (Numbers will be 
denoted by either a capital or a minuscule.) Besides that, we will also be more careful 



^For notation naggers: there is actually some abuse of notation here: we use x" and sin a; as shorthand 
for the functions x -^ x" and x — !> sin a;. In principle we could write out things more carefully, but this 
would make all formulas look ugly. 
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Figure 2.1: A pictorial way to think about functions and operators. Both are machines. 
A function has a number as in- and output. An operator takes a function as input and 
gives another function in return. 



with the differentiation operator, and denote with a subscript the variable with respect to 
which the derivative is taken: 



ox 



(2.5) 



and analogous for any other variable. 



2.1.2 Eigenfunctions and eigenvalues. 

If you had any course on linear algebra, you are probably familiar with the notion of an 
eigenvector. Given a matrix A, we say v is an eigenvector with eigenvalue A if 



Av = \v. 
(Being lazy, we didn't put an arrow on the vector v.) For example: 



2 
-2 1 



1 



1 



So the vector 



1 



is an eigenvector of the matrix 







(2.6) 



(2.7) 



with eigenvalue 2. 



-2 y ° V -2 1 

However simple the idea of eigenvector is, it is the basic concept for a lot of interesting 
math. Not surprising, it will turn out to be a very useful step to generalize it to the 
context of operators. There, the input and output are functions, not vectors. So it could 
be the case that for a certain function / and an operator A: 



Aif) = A/. 



(2.8) 
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with A a number. In that case, we say that the function / is eigenfunction of the operator 
A, with eigenvalue A. If ^ is a linear operator (and all operators we will meet from here 
on will be so) then every multiple of an eigenfunction is an eigenfunction again. Indeed, 
in the case of a linear operator A and an eigenfunction /, we have 

A{kf) = kA{f) = kXf = Xikf) (2.9) 

for every number k. The parallel with ordinary eigenvectors in linear algebra is very 




Figure 2.2: An example of two functions /i and /s, acted upon by an operator A. Function 
/i is sent to a multiple of itself: it is an eigenfunction of the operator A. Function fs is 
sent to something completely different, and clearly is not an eigenfunction of A. 



strong. To make the resemblance even more explicit, we can rewrite the condition to be 
an eigenfunction in a slightly shorter way: 



Af = A/. 



(2.10) 



Here it is understood that the operator A works on the object to its right - just like a 
matrix working on a vector. From now on we will always use this notation. So if we don't 
put brackets around its argument explicitly, it is understood that an operator works 
on the object standing to its right. So for example we may simply write ABf instead 
of A{B{f)). At first sight, it may look like this could give rise to ambiguous expressions, 
but just like with matrix products, the notation works quite well. For example, given 
two operators A and B we can define a new operator, denoted by AB, which acts on any 
function with B first, and then with A: 



AB : f ^ {AB){f) ^ A{B{f)) 



(2.11) 



Let's do some random examples. The function e^^' is an eigenfunction of D^, with eigen- 
value 7, because 



D^.e 



Ix 



7e 



Ix 



(2.12) 
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Another one: sin(x) is an eigenfunction of the operator D^D^, with eigenvalue —1, because 

DxDxSm{x) = Dx{Dx{sm{x))) = Dx{cos{x)) = — sin(x). (2-13) 

In the first step we have added brackets, to make connect with the previous notation. 
Another example. Define the unity operator '1' as the operator that sends each function 
to itself: 

1 : / ^ / (2.14) 

(In fact, this is just the operator X^.) It is obvious that all functions are eigenfunctions of 
the unity operator, with eigenvalue 1. Last example: take a function depending on some 
variables, but not on x. So, for example a function g{y, z). If we take the derivative with 
respect to x, we get 

DMy,^) = '-^ = (2.15) 

So any such function g{y, z) is an eigenfunction of D^, with eigenvalue 0. 

Sums of operators 

A last remark: we can also add operators, and multiply them by numbers, in a quite 
trivial way. The operator A + B is defined as follows: it sends every function to Af + Bf. 
In the same way, for any number c, the operator cA is defined as follows: it sends every 
function / to cAf (=c times the function Af). As an application of these rules, you can 
check that the operator X^ — 2X^ acts as: 

X^ - 2X^ : f ^ {x^ - 2x^) ■ f (2.16) 

just like you would guess naively. In general, to any function g{x) we can associate the 
operator g{X), defined as follows: 

g{X) : f ^ g{x) ■ f (2.17) 

So for example exp{X)x = e^x. That's the math for this chapter, let's move on to the 
story part! 

2.2 Bi Wavef unctions and Schrodinger 

2.2.1 The wave function 

In the previous chapter, we saw how the two-slit experiment and the photoelectric effect 
force us to describe both radiation and matter as wave packets. Especially for particles 
this requires a new framework. From here on, our main goal will be to answer the question: 
how can we describe the wave aspect of particles? Here particle can mean: electron, 
proton, neutron, or any other of the (more rare) particles that make up our universe. Let's 
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take for example one single particle on a specific moment time t. The pioneers of quantum 
mechanics did the following (very reasonable) step: let us describe this particle by a 
function: 

V'(x) (2.18) 

The symbol ip is the Greek letter Psi, pronounced as ^sigh\ Like for a wave, it gives the 
disturbance as a function of space. As we want to describe a wave packet, we expect tp to 
slope of fast outside some region which is the location of the wave packet. Schematically, 
you might see something like this 

disturbance \|/ 




spatial direction 



For simplicity we have drawn only one coordinate axis. In principle the disturbance 
ip will depend on the tree different coordinates, but for now we will describe disturbances 
depending on one single coordinate. So a wave packet is just a function il^{x). But then 
what is that disturbance '0? For radiation, there are two oscillating fields: E and B. For 
a sound wave, there is a space- (and time) dependent perturbation of the pressure and the 
density. Here however, we are dealing with a fundamentally new concept: for sure -0 is 
not a type of disturbance that you have met before. So, since a better name is lacking, %[) 
is simply called the wave function of the particle. After all, it is a function describing 
the particle's wave packet. A peculiar thing is that the wave function is taken to be a 
complex function. 



2.2.2 Why complex? 

Notice that in the examples just given (radiation and sound waves) there were several 
oscillating quantities. This is very general: most waves consist of two types of disturbance, 
which are constantly being transformed into each other, at each point in space. For 
electromagnetism, a changing electric field creates the magnetic field, and the changing 
magnetic field creates the electric field again, and so on - and this happens at each point 
in space. For a wave of water, there is the movement (velocity) of water on one hand 
and the vertical displacement on the other hand. Again, these two types of disturbance 
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are constantly transformed into each other. So many waves are described by two real 
functions (disturbances). This already suggests it is possible to group these together into 
one single complex function. 

There is another argument to do this 'grouping'. As we have stressed, for each position 
in space, the wave exhibits a cyclic behavior. The specific stage of the cycle can be 
expressed by a phase (an angle). On the other hand, a wave often has a notion of total 
amplitude (a total of the oscillating quantities). As you know, a phase and an amplitude 
can be economically summarized in a complex number, via the argument and the norm, 
of the complex number. 

So this suggests one can economically describe a wave by a single complex function. 
Indeed, in -for example- electromechanics alternating currents are often described by com- 
plex waves. This is not for fun, or to increase the complexity of the situation. There, 
such a description really is quite natural and simplifies a lot of computations. Indeed: 
wave phenomena naturally involve trigoniometric functions (cosx, sinx,...) and instead of 
using tedious trigoniometric identities, one can see these functions as the real/imaginary 
part of a single imaginary function like e*^. (See Appendix 2 of the previous chapter.) For 
water waves and pure electromagnetism, a complex description of waves is less usual. One 
reason is that there, the constituents of these waves have a very clear physical meaning 
(electric and magnetic fields, displacement of water, ...) and people like to hold on to 
those concepts. Of the many scientific fields in which people deal with waves, some use 
complex descriptions, some don't. It's a bit of a trade-off. A complex description can lead 
to easier expressions (sometimes, not always) but is of course slightly less intuitive. 

Back to quantum mechanics and the wave function of a single particle. There, the 
disturbance is not of a kind we know already. So there is no point at trying to give a 
description in terms of real fields - this would just make the mathematical expressions 
look more ugly. This suggests one should just go ahead and use a complex function to 
describe these wave packets. And indeed you will see this works pretty well. Just recall 
that there is nothing fancy about using complex functions: it does not means some intrinsic 
strangeness is involved. Complex functions are just the most natural way to deal with 
waves - that is why we like to use them. 

2.2.3 Time evolution 

We now know we should describe a particle by a complex function, the wave function. The 
next question is obvious: how does such a wave function evolve in time? Otherwise stated, 
say you have a wave function at a given time to, call this ij){x,tQ), how will it move/change 
its shape as time runs? So what is V'(2^) t) foi' later times t? A solution to this problem 
would be to find a relation of the form 

dtip{x,t) in terms of ip{x,t) (2-19) 

In other words, we want to find a differential equation relating the state ■ilj{x,t) of the 
wave function to its change dtip{x,t) in time. 
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To get a hint 

To get a hint what such a relation might look like, lets first try to generalize an (infinite) 
real wave to the complex description we will use. An infinite wave with amplitude A, 
frequency to and wave number k is typically described by 



^sin(A:x — cot). 



(2.20) 



Indeed, this is an oscillation, moving to the right with speed v = co/k. Now the complex 
function 

j^^i(kx-ujt) ^2.21) 

is an oscillation as well. Take some time to try to visualize this wave. At each point in 
space the value of the function is a complex number, with modulus A, and with a phase 
that evolves with angular frequency lo. So the above function describes a complex wave 
with angular frequency u, wavenumber k, and amplitude A. If you draw the real and 
imaginary parts separately, you get a helix shape, like in the following figure: 



4lm(f) 




Re{f) 



Figure 2.3: The complex wave f{x) = e*^, showing its real and imaginary parts separately. 



Throughout time, the wave ()2.2ip will move: the above helix will run to the left or to 
the right. Also, note that ()2.2ip is an eigenfunction of the operators 



and of 



d 
Dt = -^ with eigenvalue — ito 
at 



Dx = TT— with eigenvalue ik. 
ox 



(2.22) 



(2.23) 



Loosely speaking, the values of a; and k can be obtained by acting with the time- and spatial 
derivative on the wave. Moreover, we know from the first chapter that the frequency and 
wave number of a particle are related to the energy and momentum, 

E = hjjj p = hk (2.24) 

Combining with the above, we expect 'some' relation between the quantities E and p of 
the particle and operators acting on its wave function: 

E 



Df -f-)- —iuj = —i—- and Dr.. ^ ik = i — 

n h 



(2.25) 
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On the other hand, we know from classical mechanics that the total energy E and mo- 
mentum p (= mv) of a particle are related by the following expression: 

E = V{x) + '^ = V{x) + ^. (2.26) 

2 2m 

where V{x) is the potential energy of the particle at position x. 

A guess 

Now, using (j2.25p to replace E and p in 12.261 we get 

ihDt " = " V(x) + —i-ihfDl. (2.27) 

2m 

Of course, this equation looks a bit strange. However, if we let both sides act on ■0, we 
get the following 

ihDMx, t) = Vix)Mx, t) + ^ -DlMx, t) (2.28) 

2m 

which can be rewritten as: 

This expression is called the Schodinger equation, after the physicist Erwin Schrodinger 
who wrote it down firstolt has the shape of an evolution equation: it tells you what the 
time evolution (^) of a wave function is, given the wave function on that moment. That 
is precisely what we were looking for! It turns out that almost all particles obey the above 
time evolution equation: electrons, protons, neutrons, ... . Hence, the above equation is 
really the core of quantum mechanics - much of the rest of the course will be using this 
equation. 

Disclaimer 

If you have vaguely understood the motivation and arguments leading to the Schrodinger 
equation, do congratulate yourself. Do not think that the arguments present are even 
close to being solid though, we did not derive the equation. Such a derivation doesn't 
even exist. To see why, just think back of how the laws of Newton were introduced to 
you a long time ago. People can try to convince you that those formulas are more or less 
reasonable, but their exact form can not derived in a mathematical way: they are only 
found by doing lots of experiments, by many people thinking long and hard about how 
to put the experimental results into a small set of formulas. Newton did this for gravity 
and classical mechanics (building on the work of many others), and Schrodinger (equally 
dependent on others' ideas and results) did so for quantum mechanics. 

Of course, things will get more clear after using the Schrodinger equation. Good news: 
in the next section we will do our first real quantum mechanics computation. 



^Pronounced 'SHROEding-uh' 
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2.3 lliS Using the Schrodinger equation 

2.3.1 The Hamiltonian 

Lets look at the Schrodinger equation more carefully. On the right hand side there is 

This whole object is a function, depending on x and t. If you know V{x), and someone 
gives you some input tp{x,t), you can readily compute the above object for him. Wait 
wait wait, we have heard that before: this sound like an operator! Indeed, using notation 

of the first section HO, we can define the operator 

H = V{X) - ^Dl (2.31) 

Zm 

which sends any function to V{x) time itself, plus —-^ times its second derivative. This 
operator is called the Hamiltonian operator. Using this operator, we can then rewrite 
the Schrodinger equation in the following concise form: 

ih^^^f^^ = Hi;ix,t) (2.32) 

Some people like to write partial derivatives like ^ and ^ simply as dt and dx- They can 
stylize even further: 

ihdtip = Hip. (2.33) 

In this form the Schrodinger equation looks less scary - although the notation is more 
abstract. 

2.3.2 A real computation: a particle in a box 

We are now ready for our first real computation. Imagine a particle enclosed in a small 
space, or a "box" - for instance an electron trapped in a small region. What does its 
wave function look like, and how does it evolve in time?Let's see if we can answer these 
questions. First let's try to see if we can make the word "box" more precise. Consider the 
following potential 

Vix) = I ° for < X < L 

[ +00 everywhere else 

Between x = and x = L, the potential is constant, so the particle moves freely. Outside 
that region however, the potential energy is infinite, meaning the particle can not go 
there. In terms of the wave function: we impose that ip has to be zero outside [0,L]. 
If we also demand the wave function to be continuous, it also has to vanish on x = 
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V(x)=oo 
(barrier) 



V(x)-0 
(well) 








and X = L. Since outside the box ■0 = 0, the Schrodinger equation is triviahy satisfied 
there, so we can focus on solving the equation inside the box. For a general initial wave 
function 0(x) = i{j{x,t = 0) it is not so clear how to explicitly solve the Schrodinger 
equation. So it is hard to tell what the wave function will look like at a later moment. 
However, something special happens if the initial wave function is an eigenfunction of the 
Hamiltonian operator. In that case. 



Hipix) = E^{x) 



(2.35) 



for some number E. In that case, the time evolution of the wave packet is rather easy: it 
is given by. 



i){x,t) 



-iE, 



^/''^(^ 



(2.36) 



You can easily check that the above Tp{x,t) indeed satisfies the Schrodinger equation 
ihdt'4'{x,t) = Hip{x^t). So for each wave function that is an eigenfunction of the Hamil- 
tonian, we can write down a fully time-dependent solution to the Schrodinger equation. 
For this reason, ()2.35p is called the time-independent Schrodinger equation: it's an 
equation not involving t, and the time evolution of its solutions is very easy. Let's try to 
solve that equation for a particle in a box. Here V{x) = in the relevant region, so that 

"^ (2.37) 



H 



2m ^ 



2m dx^ 

Can we find eigenfunctions of this operator? Well, you know that the functions sin x and 
cosx are proportional so their second derivative. This suggests we should look in that 
direction. Putting in some extra constants, we find that all functions of the form: 



ip{x) = j4sin(A;x) -1- B cos{kx) 



satisfy Eil; 



-1^0, given that 



2^m 



(2.38) 



(2.39) 
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Only the function sm(kx) vanishes at x = and demanding that also ip{L) = imposes 



k 



nir 

T 



with n a positive natural number. 



(2.40) 



So the eigenfunctions of H (satisfying the boundary conditions V'(O) = i^i^) — 0) ^^^ 
given by 



ipnix) = Asm. 



mix 



L 



and they have corresponding eigenvalues 



En 



h^e n^h\^ 



(2.41) 



(2.42) 



2m ImL"^ 

A priori, the prefactors A have arbitrary values. For reasons that will be clear later, we 
will put the ^'s equal to y x- ^^ ^^ ^^^ ^^e ()2.36p . we see that the full time-dependent 
solutions to the Schrodinger equation are 



V'„(x,t) = e-^^"*/y|sin^ 



(2.43) 



The first few tpn are shown in figure below. 




The time-dependent solutions are just the above graphs, but 'rotating' in time: mul- 
tiplying by e~ "*' keeps the norm of the wave function the same throughout time, but 
the argument runs with angular velocity a; = -^. This is a pretty nice result. We find 
an infinite series of allowed wave functions for a particle in a box. The solutions are la- 
beled by the integer n, and each such solution is an eigenfunction of the hamiltonian, with 
eigenvalue En- Let's try to understand this result better. 
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Interpretation 

It is very striking that only a discrete set of wave functions (obeying the Schrodinger 
equation and the matching conditions) can be found. By discrete we mean the set of 
solutions is labeled by an integer, not by a continuous parameter. Each such solution is 
called a state. So a particle in a box can only be in a discrete series of states. As we 
saw before, the Hamiltonian operator is closely related to the concept of energy, so it is 
natural to see the eigenvalue of a state under H as its energy. So according to quantum 
mechanics, the energy of a particle in a box can only take on a discrete set of values En- 
Let's compare this to the classical (incorrect) description. If a fundamental particle 
really was a point, it would just look like a bullet bouncing from side to side in the box, 
with speed v = ^. The states would then just be labeled by the momentum p of the 

2 

particle. So in such a situation, any value of p would be allowed, and the energy E = -^ 
of the particle could take on any positive value. 

This means the classical (point particle) description is drastically different from the 
(correct) quantum description. How can we check the quantum description is really true? 
Imagine a particle jumping from one energy level to a lower one. In such a process, the 
energy difference has to be released - for example in the form of radiation: the emission 
of a photon. The energy of such a released photon can be measured. This way, we can 
measure the energy differences between the different states of a system. Sure enough, 
for particles enclosed in a small region these energy differences take on only very special 
values, confirming the spectrum is actually discrete, just like QM predicts. Of course, the 
details of that spectrum depend on the exact shape of the potential in which the particle 
is locked up. The potential discussed here is a bit simplistic/idealized. There is more than 
one dimension (here we considered only an x-dependent wave function) and it is technically 
not possible to build such a perfect box potential. However, very similar systems can be 
built. A quantum dot f.e. is a configuration in which an electron is locked up in a very 
small zone of a material. These dots can have f.e. a nice round shape. The calculation to 
solve the Schrodinger equation is a bit more difficult there, but the result is very similar: 
one finds a discrete series of states, each with its own energy. And the computed energy 
levels indeed match experimental results, backing up the quantum description, once again. 
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Exercises 

1. In the beginning of the chapter, we promised that all operators relevant to QM are 
linear operators. Check that the Hamiltonian operator is linear, by explicitly writing 
out its action on a linear combination of functions. 

2. What is the eigenvalue of the function f{x) = 1 under the operator Dx'! And of the 
function /(x) = x under the operator XD^'^ Give an example of a function which 
is not an eigenfunction of these operators (should be easy). 

3. Under what condition (on n and m) is the function x" an eigenfunction of X"*D^? 
What's the corresponding eigenvalue? 

4. Consider an electron locked up in a box, with length L = 10 nm (1 nm =10^^ m, the 
electron mass is rUe = 9, 1110"^^ kg). What is the energy E2 of the second level O 
And what's the energy of the first (lowest) level? If electron drops down from the 
second to the first level, the corresponding energy difference has to be released, for 
example in the form of a photon. What would be the frequency of this photon? 

5. The nucleus is a very small region where protons and neutrons are squeezed together 
(or 'locked up') in a region of about 1 fm = 10^^^ m. Be very crude and pretend 
that a nucleus really is a 'particle in a box' potential trapping the nucleons. If 
a neutron (?ti,„ = 1,67 • 10"'^^ kg) drops from the third to the first level, what 
energy is released? To convince you that the 'particle in a box' approximation is 
not completely ridiculous here: it is indeed possible for a nucleus to be in an excited 
state - with one or more nuclei occupying a higher energy state (such states are 
typically produced in nuclear decay processes). When decaying, typically a gamma 
ray escapes, with an energy around 1 MeV (= 10^ eV). Is this comparable to what 
you found with your 'toy calculation' ? (If not, you may have done something wrong 
with units - check the footnote.) 

6. In classical mechanics, the potential (and all other energy levels) can often be shifted 
without changing the physics. To see how things work out here: solve the problem of 
a particle in a box, but now with V =(some constant) instead of y = in the allowed 
zone. What happens to the energy levels? Try to see what happens in general if you 
send H — t- //+constant: do the eigenstates change, and their eigenvalues? What 
about time evolution? 



^Be careful with units here! Unless you are using the de relation E — hf (where you can express both 
the energy and Planck constant in eV-units) you have to express Planck's constant in standard units: J-s 
and not eV-s, since all other quantities (lengths, masses, ...) are expressed in standard units. Of course, 
at the end of your calculation you may choose to re-expess the energy you found in eV again. 



Chapter 3 

The measurement 



In this chapter... 

We are now ready to introduce another central aspect of quantum mechanics: performing 
measurements on a particle's wave function. The tool we need to introduce is the notion 
of inner product between functions. We then tell the story of the measurement in QM. In 
the last section, make all this more concrete with the study of the particle in a harmonic 
potential. 
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3.1 iio Inner products and bra(c)kets 

3.1.1 Inner product of functions 

In quantum mechanics, a very central role is played by complex functions. A simple but 
important property is that the sum of two complex functions is again a function, and a 
complex multiple of a complex function is a complex function as well. This means the 
space of complex functions forms a (complex) vector space. Many vector spaces have 
notion of distance, angles and all that. All these useful concepts are usually derived from 
one single notion: the inner product. Can you define an inner product for complex 
functions as well? The answer is: yes, you can! It is usually denoted by {f,g) and is a 
map from any two complex functions / and 5 to a complex number, as follows: 

/+00 
f{x)g{x)dx (3.1) 

-00 

Here the bar on / denotes complex conjugation. Some people denote complex conjugation 
with an asterisk, so they write f{x)* instead of /(x). The result of the above integral is 
indeed a single complex number. You may worry that the above integral is not always 
finite. Here is a relief: we will use inner products like the above specifically for wave 
functions f and g. Since wave functions fall off fast outside some region (they are wave 
packets) they have finite inner products {f,g) amongst themselves. So in the case of our 
interest, the above object will be well-defined. Roughly speaking, the inner product is 
large if the wave packets / and g have a similar shape and location, otherwise it is small. 
The inner product has a special property, called bilinearity: 

{f,agi + bg2) = a{f,gi) + b{f,g2) (3.2) 

and 

{ah + bf2,g)=a{fi,g)+b{f2,g). (3.3) 

This is sometimes put as follows: the inner product is linear in its second argument 
and anti-linear in its first argument. Lets check that for example the second of these 
properties is true: 

/■+00 
{afi + bf2,g) ■■ 



1 afi{x)+bf2{x)g{x)dx 


(3.4) 


r+00 
/ afi{x) + bf2{x)g{x)dx 


(3.5) 


a(/i,5) +Hf2,g) 


(3.6) 



Here we first used the properties of complex conjugation (see Appendix 1 on complex 
numbers) and then the linearity of an integral. Another essential property of the inner 
product (which you can easily check) is the so-called conjugate symmetry: 



{f,g) = {gj) (3.7) 
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Of course, we can also take the inner product of a function with itself: 

» + 00 f+OD 

2. 



(/,/)=/ f{x)f{x)dx= \f{x)\'dx (3.8) 

J —oo J — oo 

The last expression, being the integral of a real and positive function |/(x)p, has to be 
real and positive. Hence, 

(/,/) > 0. (3.9) 



This means we can always take the square root \/{f, /). This gives a real number, and 
says how "big" the (complex) function / is. (Something like the total size of the complex 
function /.) It is called the norm of the function /, and is denoted by 



= VifJ) (3.10) 

Using our fresh and shiny inner product tool, we can introduce yet another new concept 
(besides the norm): the Hermitian conjugate of any operator A. It is denoted by A^ 
and defined as following: A' is the operator such that for every pair of functions / and g, 

{A^f,g) = {f,Ag). (3.11) 

One can show that 

(At)t = A (3.12) 

and 

(aA + bB)^ = aA^ + bB^ (3.13) 

for all operators A and B, and all complex numbers a and b. Side remark: as mentioned 
in Appendix 1 of the first chapter, we denote complex conjugation either by a bar on 
top (like in the above expression) or by an asterisk, at will. So both c and c* denote the 
conjugate of c. Another property of Hermitian conjugation that one can show, is 

(AB)^ = B^A^ (3.14) 

So the order of operators switches upon taking the Hermitian conjugate. 

3.1.2 Column- and row vectors 

Maybe you recognized several terms of the previous section. Indeed, you might be familiar 
with the inner product of (complex) vectors from the context of linear algebra. Given two 
n— dimensional complex vectors a = (ai, ..., a„) and b = (hi, ..., 6„) it is defined as 



(a, 6) = ^ajftj (3.15) 



j=i 



You can check that this, just like the inner product of functions, is a bilinear operation. 
The only difference is that here it maps two vectors, not two functions, to a complex 
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number. In linear algebra, one usually denotes vectors in a column version. For example 
the vector v with components 1 and i is written as: 

! ) p i«> 

Next to that, vectors also have a row version, called their Hermit ian conjugate and 
denoted by ir . It is obtained by taking the complex conjugate of all components, and 
then writing the numbers in a row. For the above vector, this gives: 

v^ = { 1 -i) (3.17) 

Now what is the point of making two different versions of the same object? Well, if 
you write a column vector after a row vector, the ordinary matrix multiplication of the 
two objects gives you precisely the inner product of the two vectors. So, by using the 
right version of the vectors, people know where to take the inner product: it just happens 
automatically if you perform the matrix multiplications. For example: if someone wants to 

1 



calculate the inner product of the above vector v with the matrix product of A 
times the vector vj = { 1 , he should write down 



1 1 



This looks horrible. However, with the row-column trick he can just write the above 
expression as 

»=(! -o(; ;)(j) (3.18) 

This looks more appealing, and is completely clear: just take the matrix product twice, 
and the result you get is indeed the required inner product. It is important to remember 
that the difference between row and column vectors is not fundamental: it is just a trick 
to indicate: ah, if the two are next to each other, take the (inner- or matrix-) product. 

3.1.3 Bras and kets 

Let's try to repeat that convenient trick for the inner product of functions. Doing so, we 
will also upgrade our notation. Up to this point, we were a bit sloppy with the difference 
between a state of a particle and a function. For sure, the state of a particle is described 
by a complex function (the particle's wave function) but very strictly speaking it is not 
the same thing. A similar example: a temperature can be described by a number, but a 
temperature is more than 'just a number': it has a special meaning. To be more careful 
with this difference: if a particle has wave function /, we will say that it is in the state 
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I/). So using the vertical bar and a right bracket, we indicate that we are talking about 
a state, not just any function. (In the same way that an °F, °C or K indicates that the 
number in front is a temperature, not just any number.) Now, analogous to linear algebra, 
for a state |/), we define its hermitian conjugate, and denote it as (/|. In some sense, 
it really is the same object (it specifies the state of a particle) but in a different version - 
just like row versus column. Here the visual difference is not row or column, but the shape 
of the brackets. Just like for ordinary vector spaces, the only use is the following: if you 
write a conjugate {g\ next to |/), it is understood that you should take the inner product. 
A state written in the form |/), is called a ket, and if you write it in the (conjugate) form 
(/I, you call it a bra. The reason for these funny terms is as follows: if you write a 'bra' 
and then a 'ket', you have to take the bracket (which is slang for 'inner product'). Yes, 
this proves that physicists pull bad jokes. 

(/I b) = {f\9) = if, 9) 

(bra and then ket) = (short notation) = (bracket) 

This notation is a bit strange when you first meet it. Don't worry: its use will become 
clear later on. Hopefully you understood that there is not something deep behind it: it is 
just a convenient bookkeeping on where to take the inner product, that's all. 

When operator meets (brae) ket 

Now what about operators? When acting with an operator A on a function / you get a 
new function Af. In the context of states: |/) describes a state with wave function f, and 
1^/) describes a state with wave function Af. So this means we can see every operator 
not only as a map between functions, but also as a map between states, as follows: 

A:\f)^A\f) = \Af) (3.20) 

In this last line, A\f) means: 'act with operator A on state |/)', and \Af) means again: 
'the state with wave function Af\ This is a bit of a trivial remark - mostly a nuance in 
notation and interpretation. Time for the story part! 

3.2 S The measurement 

In the first chapter, we learned about the two slit experiment for electrons. Since the result 
of the experiment was very clearly pointing at interference (even if single particles went 
through) we concluded that each particle has to be a wave packet. This conclusion has 
been confirmed by many other experiments since - particles really are extended objects, 
described by a wave function. Yet, there is a subtlety that we have not touched upon so 
far. How does detection occur when one of the electrons arrives at the wall? In principle, 
the detecting wall measures the position of the incoming particle. But how does this work, 
given that the particle arriving is -in fact- an extended object? What outcome does a 
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measuring device give? Of course, if a measuring device could speak and gesture, it would 
answer: "Well, the particle is spread out a bit, and is more or less here", waving its hand 
at the location of the wave packet. But that is not what it is made to do, it is made 
to give you an exact answer. It turns out that a measuring device will give you a very 
precise answer for the position, one particular x. But what will it say? Which x will it 
give you? Well, the chance for a device to give you x as the particle's position 
is proportional to the amplitude |^(a:)p of the wave function there. So in some 
sense, the measuring device does a good job: the position it gives you is usually a place 
where the wave function is big, which is indeed an indication for where the wave function 
is located. Since the function |^(x)p is proportional to the chance to detect the particle's 
position to be x, we call this function the probability amplitude. 





On this figure, several things are shown: a wave packet (consisting of a single lump 
here) arrives at a detector wall. The detector wall consists of many individual detectors 
(the holes) connected to a data collector. To indicate the shape of the wave function, 
its real part Ketp is shown. The total probability amplitude |V'P = (Re^')^ + (Im^)^ is 
also shown. The odds for a detector to give you xi as an answer is really small as the 
amplitude |V'(2;i)p is very small there. An answer like X2 is much more likely to be given. 
Vaguely speaking: a measuring device will give a random, but reasonable anwer. Now 
if you like to experiment, the above explanation probably brings the following question to 
your mind. What if, right after measuring a first time, you measure again? This could in 
principle be done by making a 'transparent' detecting wall that does not stop the particle, 
and placing a second detecting wall behind it. Is there again some statistical chance for the 
possible outcomes? The answer is no! It turns out that if you do so, you will get the same 
answer. So no statistical chances anymore, nothing like that, just a 100% sure that you get 
the same outcome as the first measurement. If this confuses you, don't worry, many great 
minds in physics of the last century were very puzzled by this fact, and the only answer 
they could come up with is the following. Apparently, performing a measurement on 
a wave function drastically alters it: it seems to collapse onto the state corresponding 
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to your measurement outcome. So if you measure its position, and get as answer xi, the 
wave packet simultaneously collapses into a wave function that is very sharply localized 
around xi, even if it wasn't so before. 



measurement 
with result 




This phenomenon is called the collapse of the wave function. There are some 
theories on how and why it happens, but despite all the debate, there is no consensus on 
which one is true. The great difficulty is of course talking about a particle (microscopic 
object) and a detector (macroscopic object) at the same time. To be consistent you would 
need to describe the detector in the quantum mechanical language as well. That is of 
course a very hard task, and can only be modeled poorly. Maybe the next generation 
physicists will be able to solve this problem - lets hope so. Here is the good news: even 
though we don't know why it happens, it is very sure that this happens - to very high 
accuracy - so this gives a partial relief. 



The tvi^o slit experiment, revisited 

To illustrate the above behaviour, let's go back to the two slit experiment. Say we use an 
electron source and put it at a very low intensity, to make sure it shoots out the electrons 
one by one. As a particle arrives at the slit, its wave function is still a nice blob. Then, as 
it passes through the holes, the two parts of the wave functions (passing through the left 
and right slit) interfere with each other. The result is a "striped" wave function: only on 
places of constructive interference, the amplitude |V'P is big. Now the striped wave function 
arrives at the detecting wall. The chance for the detecting grid to answer to you that the 
particle arrived at position x is large at the stripes only, since the probability amplitude is 
large there only. So very probably, it will answer you a position -say some xi- lying within 
one of the stripe's positions. Now a second electron comes through. The shape of its wave 
function after passing through the slits is identical to that of the first electron. However, 
the result of the measurement is not necessarily the same. The measurement may yield 
any outcome X2, with probability |'(/'(x2)p. Obviously, after a lot of particles have gone 
through the slits, the total detection rates will be nicely proportional to the probability 
amplitude |V'(x)p of the individual particles. So even though individual measurements 
give very few information about the wave functions, the total detection count reveals 
this information very accurately. This way, the wave property of the individual particles 
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inevitably becomes apparent. So we conclude the first interpretation of the experiment 
was correct, although the physics involved has a statistical subtlety. Got it? 

Wave packets look like point particles 

Also, the above explanation should make clear why particles appear as points to us in 
many experiments. This is because the position measurement gives very sharp results, 
and simultaneously collapses the wave function on a very localized one. In a more lyrical 
phrasing: the measurement forces the wave function to act like a particle, or better: you 
may easily get the impression they are strict points indeed, although this is more an 
artifact of the measuring process. 

3.3 (iD More precise 

So far the story part, let's now make all this more concrete and precise. We said that the 
chance to measure a particle with wave function ip to be at position x is proportional to 
|'0(x)p. By this we actually mean: the chance to measure the particle to be in a very 
small interval [x,x + Ax] is given by Ax • |'0(a^)p. For a larger interval the probability is 
given by the integral of |'(/'(2;)p: 

rb 

P{[a,b]) = / \ij{x)\'^dx. (3.21) 

J a 

where by P{[a,b]) we mean the chance to detect the particle in the position- interval [a,b]. 
For this reason, people say that ['(/'(a;)]^ is a probability density. 

Consequence 1 

The total chance getting a measurement result between x = — cc and x = +oo has to be 
1 of course. This means we need 

|V'(x)pdx = l (3.22) 

In the terminology of the first section: the square of the norm of V' should be 1: 

(iPl^P) = 1 (3.23) 

If this is not the case for some wave function, you should multiply it by a number so that 
it does. Concretely, you can do this by dividing ip by its norm: 
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This is called normalization of the wave function. If a wave function (or state) is 
not normalized, that isn't a deep problem, but in that case you can't immediately use 
the probability law ()3.2ip . You can now understand why we put the A's involved for 
the particle in a box to a specific value: we wanted the wave functions to be properly 
normalized. More precise: the norm of the particle-in-a-box wave functions can be shown 
to be 

(V'„,V'n) = |^|'^ (3.25) 

So to achieve normalization, we have to put A= J j^ 

Consequence 2 

Another thing. Lets try to calculate the expectation value of the position. So given a wave 
function, what is the average position you would measure? This is the number you would 
get if you would measure the position again and again and take the average - at least if 
after each measurement you are able to put the particle back in its original state, undoing 
the effect of the collapse of the wave function. More practical: it is the average of all the 
consecutive position measurements of a large collection of particles, which you manage to 
put in the same state. (Just like in the two slit experiment for example.) In words, this 
quantity is given by summing for all intervals [x, x + Ax] the chance of finding the particle 
in that interval tivaes the corresponding position x. So 

/■+00 

{X)^ = X- |V'(x)pdx (3.26) 



Here we have denoted the expectation value of the position for a wave function Tp by {X)^. 
Now the above can be written in a very compact way using the bra-ket notation: 

(X)^ = (iPlXlij) (3.27) 

Indeed, acting with the operator X on the function i{j{x) gives x ■ ip{x), so the state Xlip) 
( =|X^)) has wave function x%l){x) and 

(V|X|^) = {^\Xij) (3.28) 



V'(x) (x • ^(x)) dx (3.29) 

— oo 
oo 

x|^(x)|2 dx (3.30) 

-oo 

The object (V'l^lV') is sometimes called the sandwich of the operator X inside the state 
V'. This origin of this term is probably clear. In general, we call the object {ip\A\(j)) the 
sandwich of operator A between states ip and (f): 
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<y|A|^> 

Figure 3.1: Two sandwiches 




We can cryptically summarize the above paragraph by stating that the expectation 
value of the position for a state is given by the sandwich of the position operator X inside 
that state. If you understand this sentence, you have clearly gotten the above. We are 
now ready for another important example: a particle in a harmonic potential. There, we 
will be able to apply all the bra-ket machinery. 

3.3.1 Example: the harmonic oscillator 

Let's consider a particle with mass ?7i in a quadratic potential: 

V{x) = -mcj^x^ (3.31) 

Classically, an object in such a potential swings perpetually around its equilibrium, with 
frequency w. That is indeed what a marble would do in such a potential (ignoring friction). 
But to see what a elementary particle in such a potential looks like, we need to use the 
quantum mechanical description. The key is of course the SE (short for Schrodinger 
Equation) : 

ihdtip{x,t) = H'il;{x,t) (3.32) 

For this potential, the Hamiltonian is given by 

H = -fDl + V{X) = f + ^X' (3.33) 

2m 2m 2 

In the last step we have also defined the momentum operator 

P = -ihD^. (3.34) 



General recipe 

The strategy is exactly the same as for the particle in a box. Assume we can solve the 
time-independent SE. That is, assume we can find energy eigenstates IV'n): 

H\^n) = En\i^n) (3.35) 

Then their fully time dependent counterparts are given by 

V'„(x,t) = e-*^"*/V„(x) (3.36) 
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Figure 3.2: The (classical) movement of a macroscopic object in a harmonic potential is 
shown by the oscillating point. For a microscopic particle things will look differently: the 
particle will take on some specific wave function ip{x) inside the potential. 



and these automatically satisfy the SE. So the problem is reduced to one real task: finding 
all the wave functions Tpn{x) that solve to the equation 



Tf (f 1 



(3.37) 



Now there is bad news and good news. The bad news: unlike for the particle in a box, we 
now have a nasty potential term in the Hamiltonian. This makes the above equation is 
not easy to solve. The good news: there is a neat trick to solve the problem in an elegant 
fashion. Here goes. 

3.3.2 A neat trick 

Let us define the following operators: 



^ X + ^P 
2n V Tnoj 



muj 
~2h 



X 



moj 



(a)t 



^'-•- 






p^]=J^ix 



-P 



2h \ muj 

In the first step, we used rule ()3.13p for Hermitian conjugation, namely 

[aA + bBy = aA^ + bB' for all numbers a, h and operators A, B. 

and in the second step, we used the fact that X'^ = X and P^ = P, a property you can 
easily verify (see exercises) - but just buy it for now. If we now compose those operators 



(3.38) 
(3.39) 

(3.40) 
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into a new operator a^a, we get: 

ata = ^(X + ^P](X-^P 

= ^^[^ + {XP-PX) + P' 

2h \ moj 



IX' + —{XD- DX) - -^^D' (3.41) 

In the last line we did not write the index x under the differential operator D - since it 
is clear we mean differentiation with respect to x. If X and D were ordinary numbers, 
the object XD — DX would be zero. But they are operators, and hence XD is not 
necessarily the same as DX\ Lets think about this more careful. The operator XD takes 
the derivative of a function, and then multiplies by the identity function x. 

XD:f^ XDf = xf (3.42) 

But operator DX does something else: first it multiplies by x, then it takes the derivative 
of that object. Using the product rule of differentiation, we get 

DX:f^ DXf = D{x-f) = f + xf (3.43) 

So XD and DX are really different operators. In general, for operators A and B, the 
object AB — BA is called the commutator, and is denoted by [A, B]: 

[A, B] = AB- BA (3.44) 

If the commutator of two operators is zero, then AB = BA, and we say the two operators 
commute. In words: for commuting operators their order doesn't matter. If the com- 
mutator of two operators is not zero, we say they don't commute, and then AB ^ BA. 
Ok, back to the above. As XD ^ DX, we see that D and X don't commute. In fact, for 
any / we have 

{XD - DX)f = xf - (/ + xf) = -/ = -!•/ (3.45) 

So here the commutator is given by: 

[X,D] = -l (3.46) 

where 1 is the unity operator we met before. Using this, we conclude that 

+ muj /^^9 h h^ ^r,\ , ,„, 

a^a=—-[X^ + ^r-2^ 3.47 

Ih \ muj m'^oj'^ J 

So the Hamiltonian can be written as 

H = nLjia)a + -\ (3.48) 
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Woah, what a coincidence! In terms of the a's, the Hamiltonian looks quite simple. With 
a similar calculation to the above, you can check that 

[a,a^] = l. (3.49) 

And from (|c{.48p and (|3.49p one can show 

[H, a] = —hua and H, a) = hujo) (3.50) 

Giving us even more remarkable results. But what do we get from all these formulas? 
Hold on for a moment, we will extract some important consequences ater the next section. 
First, we need one last sidestep. 

3.3.3 Sidestep: getting familiar with bras 

Suppose you have a state \ip). Then a\tp) is another state. Here is a question: what is the 
hra state corresponding to this ket state a\tp)l Recall that the bra and ket are related by 
Hermitian conjugation: 

(V^l = IV')^ (3.51) 

So the bra version of alV's) is given by (a|^))^. From formula (|3.14p we know that for two 
operators {ABy = B'A'. It turns out that for the product of a state and an operator, 
the same property holds. We will not prove this, and use it as a given rule. In particular 
this means that 

{am^ = (|V))tat = (V'lat (3.52) 

So the object on the right hand side is the bra version of the ket a|^). Now, recall 
that the norm squared of a state is obtained by putting the bra and the ket together: 
II 1^) IP — (V'l^)- This means that the squared norm of the state a|^) is given by 

||a|^)f = (^|ata|^) (3.53) 

You can read the object on the right hand side in different ways. Either you see it as the 
bra {ipla^^ followed by the ket alV'), or you see it as the operator a^a sandwiched inside the 
{ipl and \ip). What viewpoint you take does not really matter - all really describe the same 
object. In mathematical terms: building objects like in the above line is an associative 
operation: it does not matter which objects you want to group or put around brackets, so 

ma^){am = ma^am = ma^am = ... (3.54) 

and so on. This might seem a bit suspicious at first, so a comparison with ordinary vectors 
and matrices might be in place. Imagine you want to compute the norm of the vector v, 
with 

' 1 ^ ^ 1 ' (3.55) 



1 / V 
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You can do this in two ways. Of course, you can compute that v 



norm is given by 



Ifll = v^v 



However, you can also directly take the conjugate of ()3.55p . so that 



and hence the 



(3.56) 



\v\\ = v^v 



1 



) 



1 -i 
1 



1 
i 1 



(3.57) 



If you compute the above line, you will again find 2 as an answer. But what we want to 
stress, is that it does not matter in what order you perform the matrix products. You can 
start out left, or you can multiply the matrices in the middle first, whatever you like. So 
here too, you can view the object in different (equivalent) ways, thanks to associativity. 
The situation with ()3.53p is precisely the same. 

3.3.4 The fling 

We can now start using all the above. Suppose there is a (normalized) state \iPe) that is an 
energy eigenstate (=eigenstate of the Hamiltonian) with energy E. Then a\ipE) is another 
state, and from the above we know that its norm is given by (V'eIo alV's)- Because the 
norm of that state is positive (this is always the case), we must have 



< (^£;|a^a|V'i 



Expressing a^a from the above expression in terms of the Hamiltonian, we get 



< 



i'E 

IpE 



(- 

(- 



H 

fvjj 

E 

hjj 

1 

2 

1 

2 



1 
~ 2 



tl^E 

i^E 



(3.58) 

(3.59) 
(3.60) 
(3.61) 
(3.62) 



In the first step we have used that |V'_b) is an eigenstate of H with eigenvalue E (by 
assumption). Then, we dragged some things out of the bracket to the left: this is OK 
because they are all just numbers. The last step then uses the normalization of |V'_b)- In 
fact, we have also used the linearity property of the bracket implicitly in the first two 
steps. If you now compare the first and last part, you see that E > huj/2. So every state 
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has an energy of at least -^^u:. That is already a nice result. But there is more. What if 
we act with the Hamiltonian on this state o|'i/'^)? 

H{a\i^E)) = {[H,a] + aH)\i^E) 
= {—hwa + aE)\TpE) 
= {E - huj){a\4^E)) 

The second step uses ()3.50p . In the last step, we dragged E to the left. This is OK because 
it is just a number, so there is no problem when you pass it to the other side of an operator 
like a. Comparing the first and last part, you see the state a \tpE) is an eigenstate of the 
Hamiltonian with energy E — huj. So it is a state with one 'step'(= hu) energy less than 
{iPe)- Similarly, you can show that 

H{a^\i;E)) = {E + nw)iaUiljE)) (3.63) 

Meaning that a' IV'b) is an energy eigenstate as well, but with one 'step' more energy. This 
means that (given one energy eigenstate) we can build many more energy eigenstates, with 
lower or higher energy, by acting with the operators a' and a. That's why a' and a are 
called the raising and lowering operator, as they raise and lower the energy of a state 
by one energy step huj. 

Now there might seem to be a contradiction here: by acting repeatedly with a, we can 
get a series of states with lower and lower energy, but we just found that the minimum 
energy is Smin =2^- This can only be reconciled if there is some state |0), for which 

a|0)=0 (3.64) 

so that the lowering process ends there. The right hand side here is the zero ket, a state 
with wave function equal to zero, meaning there is no particle at all. Any subsequent 
application of the lowering operator will just give the zero ket again, instead of additional 
energy eigenstates. So the state |0) necessarily is the state with lowest energy: 

H\0) = -nw\0) (3.65) 

Starting from this lowest energy state we can now build up a series of states (using a^), 
which have higher and higher energy. (We may have to normalize them properly, but that 
does not affect their energy.) We denote this tower of states by 

{|0),|l),|2),...,|n),...} (3.66) 

Their energies are given by 

H\n) = huj(n + -]\n) . (3.67) 

So, in a very elegant way we have solved the spectrum of the harmonic oscillator. We have 
constructed a series of energy eigenstates, and know that these are the only ones around. 
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The time-dependent solutions are given by multiplying each state \n) by e~ "^' , like in 
(j3.36p . So this solves the problem of describing a particle in a harmonic potential. 

Of course, you may have some difficulties imagining what these states look like. If so, 
you will find relief in the next section. 

Their shape 

Maybe we should start with the lowest energy state. The lowest energy state satisfies 
a 1 0) = so by definition of a the wave function ipn=oix) corresponding to |0) satisfies 



xipoix) H ■;—{x) = 

muj ax 

which can be shown to be solved by the wave function 

ipoyx) oc e ^n^ 
or (after normalization) 



'^o{x) = (^^I'e "h"^' 



and its time evolution is given by 



(3.68) 

(3.69) 
(3.70) 

(3.71) 



In a similar fashion, one can explicitly find the wave function of the other energy eigenstates 
|2), |3), etcetera. This is a bit tedious, and we will just show the result pictorially: 



\ 



/' V/x) 





^p-) 




VjCx) 




V/x) 


^\ 


Vn('^) 


, 1 ^ 



What about measuring the position of a particle in one of these states? As you know, 
the chance to get an outcome x from a measuring device is proportional to ['(/'(x)^. From 
the figure you see ipn{x) becomes more and more spead out with increasing n. So for 
higher energy states, it becomes more and more likely to measure the particle to be far 
away from the origin. Intuitively, the particle spreads more throughout the potential as it 
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gains energy, somewhat like a classical oscillator gaining amplitude with increasing energy. 
What about the expectation value of the position? This is given by 

/oo 
x\Mx)\^dx (3.72) 

-oo 

As you see from the figure, the wave functions ipnix) are all symmetric or anti-symmetric: 
fpnix) = ±ipn{—x). Hence the squared amplitude has |^„(x)p = \ipn{—x)\'^- So the 
integrand x\ipn{x)\'^ above is an anti-symmetric function: the value at x and —x are exactly 
opposite. Such functions have integral zero, since the contributions from the positive and 
negative integration domain exactly cancel. So 

(X)^„=0 Vn (3.73) 

So we find that the expectation value of the position measurement is zero, for all states. 
This is not so surprising: after all the system is left- right symmetric. Of course, this does 
not mean you will always get as an outcome of the measurement: the above is just a 
'weighted average' - which happens to be zero. 

3.3.5 Conclusion 

Summarising, there are several special aspects to this system. First, it is remarkable that 
the lowest energy is not zero. Classically, a particle at rest in the potential has zero 
kinetic and potential energy, and hence zero total energy. For a wave function, this turns 
out not to be the case: the lowest energy it can reach is ^hoj. Some people express this (in a 
somewhat obscure, but poetic phrasing) as: "Quantum mechanically, a harmonic oscillator 
can never be at rest." By this they just mean: "A particle in a harmonic potential can 
not have zero energy." Besides that, we also see some similarities with the particle in a 
box. Again, only certain energy levels are admitted. There is no state with energy 3.1hw 
for example. This means the transitions between different energy levels of the particle can 
release only specific amounts of energy: only multiples oi fkjj. In the classical theory of 
(point-like) electrons and electromagnetism, an electron that is harmonically oscillating 
at frequency uj emits radiation with frequency u, hereby lowering its energy. However, 
this loss of energy can occur in arbitrary small steps. According to the correct (quantum 
mechanical) description, the situation is different. Here the particle can emit energy too, 
but it is clear that this radiation should always have energy nfkj for some natural number 
n. Do you smell n photons of frequency co coming out of the particle? Nice, huh? 

You may now understand better where the name 'Quantum Mechanics' comes from. 
In the situation here, for the particle in a box and in many more examples we will see 
that discrete amounts of energy are typically involved. We meet discrete energy levels, 
a photon is a discrete package of energy, and so on. The word 'quantum' (derived from 
Latin) precisely means 'discrete amount' or 'small package'. So Quantum Mechanics is 
in essence the description of systems involving discrete amounts of energy: the world of 
particles. 
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Exercises 

1. Prove (|3.12|) . (|3.13|) and (j3.14|) . (Hint: you'll need the conjugate symmetry and 
bilinear ity.) 

2. Compute [X^,D^]. 

3. Prove that if an energy eigenstate is normalized, then it will still be normalized at 
any later moment in time. (Just write down its time evolution and take the norm.) 

4. Verify [H,a] = —hwa and [-ff, a^] = tiwa^ . 

5. Compute il^i^x) from the expression for ijjoix). (You don't need to normalize it.) 
Try to draw this function. Does it match the graph shown in section 13.3.41 .'' What 
is the time-evolution of this state? 

6. Imagine you can create an electric field of the form E = —kx with n = lOOV/m?. 
(Check that the units are right.) Put a proton {nip = 1, 67-10"^^ kg, qp = +1, 6-10~^^ 
C) in that field - it will stay trapped around x = 0. What frequency should a photon 
have to be able to kick up the proton from the lowest to the next-to-lowest energy 
level? 

7. Prove that the operators X and P are Hermitian. (For the second: think of partial 
integration. Why can you drop the boundary terms for a wave packet?) 



Chapter 4 

Observables 

In this chapter... 

In the previous chapter, we learned how a position measurement works out in quantum 
mechanics. In this chapter, we will extend this further, and explain how general mea- 
surements (such as the energy or velocity of a particle) work. Doing so, we will learn 
more about the space of states of a particle. We first introduce the concepts 'Hermitian 
operator' and 'state basis'. Then follows the story part (which generalizes the previous 
chapter to other measurements) and we conclude by a simple example. 
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4.1 l4iJ Hermitian operators and the Hilbert space 

4.1.1 Hermitian operators 

You know by now that to each operator A, we can associate another one, its Hermitian 
conjugate A^ . It turns out that for some special operators, the following holds: 

A = A^ (4.1) 

By this we just mean that for every function /, the outputs Af and A^ f are equal. 
Such operators are called Hermitian operators. As we will learn in this chapter, such 
operators are very important in quantum mechanics. One special property they exhibit, 
is that they only have real eigenvalues. Indeed, suppose a Hermitian operator L has an 
eigenvalue A. Then there exists a function / such that Lf = A/, and 

A(/,/) = (A/,/) = {LfJ) = {f,Lf) = X{f,f). (4.2) 

In these steps we used bilinearity of the inner product, Hermiticity of L, and the fact that 
/ is an eigenfunction. Since (/, /) is nonzero (true for every nonzero /) the left and right 
side of the above equation gives A = A, so A must be real. This shows that every eigenvalue 
A of a Hermitian operator indeed has to be real. Besides that, an important property of 
Hermitian operators is that their eigenfunctions are orthogonal. By this we mean 
that eigenfunctions with different eigenvalues have an inner product equal to zero. Indeed, 
imagine functions /i and /2 with eigenvalues Ai and A2(7^ Ai) under a Hermitian operator 
L. Then 

A2(/i,/2> = {fi,Lf2) = (L/1,/2) = M/1,/2) = Ai(/i,/2) (4.3) 

In the last step we used the fact that an eigenvalue Ai has to be real. Once again combining 
the first and last side of the equation, we now get 

(A2-Ai)(/i,/2) = (4.4) 

Which (as A2 7^ Ai) means that (/i, /2) = 0, so indeed /i and /2 are orthogonal. 

4.1.2 A space of states 

In the previous chapters, we solved two systems: a particle in a box and a particle in 
a harmonic potential. In either case, we found an infinite tower of states. Each state 
describes a specific wave function the particle can be in. You may wonder what the 
meaning is of sums of these wave functions - or in general: any linear combination. These 
are complex functions as well. And they satisfy the same good properties: they are smooth 
and have a decent fall-off (so they can be normalized). On these grounds, we could be 
tempted include these combinations as valid wave functions as well. Doing so, the space 
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of states becomes a complex vector space! That space is called the Hilbert space. So if 

-01 (x) and ip2ix) are the two wave functions (i.e.: elements of the Hilbert space) then 

x{x) = aV'i(x) + b'4)2{x) (4.5) 

is a wave function as well, for any value of the complex numbers a and h. You may still 
need to normalize this wave function x{x), but that is just a matter of multiplying by a 
constant - not really a deep issue. In the language of kets: 

|x)=a|Vi) + 6|02) (4.6) 

gives a new state \x) built from the states |^i) and |V'2)- So if you take the sum of two 
kets, it is understood that you mean the state whose wave function is the sum of the wave 
functions: 

a|V'i) + 6|^2) = |aV'i + ^02) (4.7) 

Of course, we can also take linear combinations of more than two elements. This way, all 
linear combinations of the original set of states become elements of the Hilbert space. So 
the tower of states we found for the harmonic oscillator and the particle in a box form a 
basis of the Hilbert space. 

From a 'the more the merrier' point of view, the above is not really an unpleasant fact. 
However, something might bother you. We were very proud that we could explicitly write 
down the time-evolution of the states we found. But now we have a whole new zoo of 
states that we have to consider. How do all these evolve in time? This turns out to work 
out nicely. Let us look at the Schrodinger equation once again: 

ihdt^Pix,t) = H'ip{x,t) (4.8) 

This differential equation is of a special kind: it is a linear differential equation. That 
means that if you have two solutions satisfying it, every linear combination satisfies it too. 
Let us make this more concrete. Say at t = we consider two wave functions '(/'i(2^)0) 
and ^lJ2ix,0) that are energy eigenfunctions with values Ei and £'2- We know that their 
time-evolution (dictated by the SE) is just given by 

Mx, t) = M^, 0)e-'^i*/^ Mx, t) = Mx, 0)e-*^2*/^ (4.9) 

But what if at t = we start out with the state x{x,0) = aipi{x, 0) + bip{x,0). How does 
this evolve in time: what is xix,t)'^ A priori this could be everything, but because of 
linearity, the function 

X{x,t) = aV'i(x,i) +6V2(x,t) = aV'i(x,0)e~^^i*/'"' + 6V2(x,0)e"^^i*/'^' (4.10) 

automatically satisfies the SE since ipi{x,t) and ip2ix,t) do. In conclusion: the time 
evolution of a linear combination is just the linear combination of the time evolutions. 
This means we do not need to do any extra work when describing the entire Hilbert space. 
Time for the story... 
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4.2 Hi Observables 

In the previous chapter, we learned about the probabihty amphtude and how measuring 
a particle's position deforms its wave function. Of course, besides position there are many 
other properties of a particle that you may measure: energy, velocity, angular momentum, 
or any combination like 'position times momentum' or 'position squared' and so on. Every 
quantity of a particle that you can measure (or 'observe') is called an observable. It 
turns out that every observable is associated to a unique operator. So there is a strict 
correspondence between the numerical quantities we can observe in experiments with 
particles, and operators acting on the space of wave functions (Hilbert space). Here is a 
list of pairs we have met so far: 

energy (E) o Hamiltonian operator H 
position (x) -H- position operator X 
momentum (p) o momentum operator P {= —ikD^) 

(4.11) 

Now comes a very nice surprise: it turns out that all observables -including the above- 
are Hermitian. That is a very nice property: we just learned that the eigenfunctions of a 
Hermitian operator are orthogonal. If you normalize those states, you get an orthonorma/ 
basis. 

The value of such an orthonormal basis can not be stressed enough. To see why, think 
back of an ordinary complex vector space for a little moment. If you have a basis {ei, ...e„} 
then every element v in the vector space can be written as 

V = viei + ... + Vnen (4.12) 

for precisely one combination {vi, ...Vn} of complex numbers. (We don't write any vector 
arrows, out of laziness.) So the advantage of a basis is clear: it allows specifying every 
element of the vector space uniquely, by a set of numbers. The problem is that (for a 
given vector) it is not immediately clear how to find the above decomposition. But if a 
vector space is endowed with an inner product, things are different. One can try to find a 
special basis, namely an orthonormal one. By orthonormal we mean that 

{ei,ej) = 5ij (4.13) 

Here Sij is the Kronecker delta function, which equals one ii i = j and equals zero if i ^ j. 
For such a basis, and any vector v in the vector space, 

{ei,v) = {ei,viei + ... + Vnen) = ui • 1 + + ... + = ui. (4.14) 

Where we used linearity of the inner product and orthonormality of the basis. So the 
number vi (and analogously any of the Vi) can be obtained just by taking the inner 
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product of V with ei (or any of the ej). This is a very strong fact: not only is every 
vector uniquely specified by a set of numbers, this decomposition can be obtained easily 
by taking the inner product. The quantities Vi are called the projection of v on ei. They 
tell you how much v is directed along the vector e^. An example in two dimensions, the 
vector V = 2ei + 6*2 and its projections: 



<v ,e,> =1 




y. = 2 e^ +62 



Things are very similar for a Hilbert space. There too, we have a basis. There too, 
we want to be able to write every element in terms of this basis. But in the context of 
the Hilbert space, the advantage of an orthonormal basis is even more drastic. In that 
case the set of basis elements is typically infinite. If someone gives you a function, and 
asks to decompose it as a combination of basis elements, you will have a very hard time. 
For an orthonormal basis, things are not that harsh: you can obtain that decomposition 
in a straightforward manner. Indeed, the projections on the basis elements are just given 
by the inner product, which requires just performing a definite integral. Later on, we will 
perform some concrete decompositions. What you should remember from this, is that the 
set of eigenstates of an observable provides a very good basis of the Hilbert space: an 
orthonormal one. So every observable is a bit like a fairy, it can help making your deepest 
wish come true. 

Think back of -say- the particle in a box for a moment. There we found eigenstates 
of the Hamiltonian. You now know that this operator is Hermitian, so you conclude that 
the different energy states we found were orthonormal. Indeed, if you like performing 
integrals, feel very welcome to try to compute the inner product between different energy 
states. It will always give you zero. 

Now imagine that -for a given problem- you have found the eigenstates/ eigenfunctions 
of some observable O. What is the physical meaning of these eigenstates? Say f.e. that you 
have a eigenstate under the operator O with eigenvalue A. It is natural to interpret this 
as a state for which the observable corresponding to O has value A. Actually, 
we used a specific case of this interpretation already: we interpreted eigenstates of the 
Hamiltonian with eigenvalue E as states with energy E. Similarly, an eigenstate of the 
momentum operator P with eigenvalue p will just mean a particle with momentum p, 
etcetera. 
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4.2.1 Non-eigenstates 

Also the negation of the above interpretation makes sense. A state that is not an eigenstate 
of O is a particle for which the measurable quantity corresponding to O does not have 
a specific value. Let's be more concrete. Suppose the eigenstates of an observable O are 
given by states {i/jn), with corresponding eigenvalues A„. If we have also normalized the 
\ipn), then they form an orthonormal basis: 

(■0n|'0m) = Smn- (4.15) 

Now a state of the form 

|V')=^C„|Vn) (4.16) 

n 

(with the c„'s complex numbers) is clearly not an eigenstate of O. (Unless only one of the 
Cj is nonzero.) If you measure the observably quantity corresponding to O of a particle in 
the above state, your apparatus will give you any of the values Aj at random, but the 
probability of getting a specific Aj as an answer is given by 

-fV'(Aj) = probability to measure A, = ^^ ^ — ^ (4.17) 



2-^n 



Cri 



Just as with measuring the position, the measurement drastically changes the state. If your 
measurement results in some Aj, the state will collapse onto the corresponding eigenstate, 
so 

if measurement gives outcome Aj then \i\)) — )• iV'j) (4.18) 

Let us look at the denominator in (|4.17p for a moment. This object is just the norm of 
the state 1^^). 



km) 



[y^Oak^rn \y^Cm\^m)\ (4.19) 

— / J / J CnCmOnm (^4.ZUj 

n m 

= T.\'^\' (4.21) 

i 

From this, it follows that expression (|4.17p does not change if one replaces the state {ip) 
by any complex multiple a|'0) of itself: both the numerator and denominator will scale up 
with a factor |ap. This implies that the seemingly different states \i/j) and a\tp) give the 
same measurement probabilities, so they represent the same physical object. Stated 
more roughly: they are just different names for the same thing. This is of course related to 
the issue of normalization. States {ip) and a\ip) may look differently, after normalization. 
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they are the same objecto If we demand {ip) to be normalized from the beginning, ()4.17p 
simplifies a bit. In that case the denominator on the right equals one, so that 

P^i^i) = \ci\ (for normalized |^)) (4.22) 

Note that the total chance of measuring any of the A^ is one, as should be. The expectation 
value of the observable O is then given by 



I 



You can check that the above can be rewritten in a compact fashion using the sandwich 
of O: 

^^^^ - iv#r ^ ^ ^ 

which is very similar to the expression for the position expectation value. For a normalized 
state, the denominator is again one. So this is how the measurement works for observables 
other than the position. Time for an example! 

4.3 HP Working with the Hilbert space 

4.3.1 Example 

To illustrate all the above, let's go back to -say- the harmonic oscillator. Recall that we 
found 

H\n) = huj(n + -] \n) (4.25) 



2^ 

or by using H = hoj (^a^a + i) , 

a'aln) = n\n) (4.26) 

For this reason, the object a'a is called the number operator - it just gives you the 
state number. (More precise: the n-th energy eigenstate is an eigenstate of the number 
operator as well, with eigenvalue n.) If we require that \n) is a normalized state, we have 

{n\a'^a\n) = n{n\n) = n (4.27) 

We also found that a\n) is a state with one quantum less energy. So necessarily a\n) ~ 
|n — 1). There could be a proportionality constant involved here: \n — 1) is normalized 
but a\n) may not be so. Indeed, the above equation precisely shows that a\ijj) has norm 
squared equal to n. So we conclude 



an) 



Vn|n-1) (4.28) 



^To be overly precise: even if one asks \tp) to be normalized, there is still some freedom of renaming: 
all states a\ijj) with a = e' (a complex number of unit modulus) are still normalized, and all represent the 
very same physical state. 
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Similarly, acting with a' on this equation and renaming n — )• n — 1 shows that 



a' n 



) = ^n + l\n+ 1). 



(4.29) 



Actually, there is a nice way to make all these things more concrete. Let us denote the 
state \n) by the n-th basis vector: 



/o\ 



\n) o 



(4.30) 



V '■■ J 



where the 1 is on the n-th position. Obviously, to include all states we need to take the 
vector on the right hand side to be infinitely long. This is a bit funny, but the notation 
works fine. A general state in the Hilbert space is then denoted in vector form as follows: 



E 



Cn\n) O 



Cl 



(4.31) 



We start labeling by because the state labels start from zero on. In a similar way, we 
can denote the bra version of states by a row vector. For the state {ijj) above, we have 



(V'l O ( Co Cl C2 . . . ) 
In this way, the norm squared of a state is just the matrix product: 

/co\ 



(4.32) 



(V'lV') 



Co Cl C2 . . . ) 



Cl 
C2 



Y.\Cn\ 



(4.33) 



V : / 



With this notation, we can represent operators as matrices acting on the vectors. For 
example, looking at (|4.28p it follows that the lowering operator is given by 



a f-)- 



/ \/T 
0^/200 
0^/30 
0^/4 



\ 



(4.34) 
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The raising operator is a similar object: 



a^ o 



/ 








... 


Vi 








... 





V2 





... 




\ 


\/3 





... 



(4.35) 



/ 



In fact, this is just the Hermitian conjugate of the previous matrix - this just stems from 
the fact that operators a and a"^ are Hermitian conjugate operators. If you perform the 
(infinite) matrix product, you see that H = Hlo (^a'a + 2) should be written as 





/0 + i 











... 







1 + i 








... 


hu! 








2 + i 





... 






[ : 








3 + i 


... 



\ 



(4.36) 



/ 



There are several observations here. First, this is a Hermitian matrix. This is actually 
due to the fact that H is a Hermitian operator. Second, H is a diagonal matrix. This 
expresses that the basis we use consists of eigenstates of H. More than ever, the parallel 
with linear algebra is clearly present - although here we work with infinitely big matrices. 
Let us now consider a particular state |V') = |0) + |1): 



/1\ 
1 


{-■■ ) 



(4.37) 



This state is not normalized however: (V'l^) = 2. So to normalize, we need to divide by a 
constant: 

1/^/2 
V2' 







V 



(4.38) 



/ 



The state equally combines both first energy levels, so if you measure the energy, you 
expect that there is ^ chance to detect an energy Eo, and ^ for Ei. Indeed: 



P{E -^ Eo 



\co\ 



2 = 1^11 



P{E 



(4.39) 



Now say you measured the energy of the particle to equal Eq. The state then collapses 
onto that level: \tp) — t- \ip') = |0) after the measurement. If you measure the energy again. 



+ (l|)(^l|0)+S2|l)) 
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you are sure that you get Eq as an answer again, because now 

P{E ^ Eo) = \c'o\^ = 1 (4.40) 

and all the other are zero. Indeed, the wave function collapsed. Now what was the original 
expectation value of the energy? One-half chance on either of the energies means that 

iE),^^^ = i^^i^ = n. (4.41) 

just their average. Is this equal to the sandwich of H inside \ip)? Yes it is, 

i^^wm = ^((oi + (i|)i7(io) + |i)) 
_ 1 

~ 2 

= ^{Eo + Ei). (4.42) 

For the time evolution, we note that the time dependent version of |0) and |1) are given 
by 

^-iEot/h^Q^ and e-'^'*/^\l) (4.43) 

such that 

IV'(t)) = ^e-^^«*/'^iO) + -^e-'^^^'/^ll) (4.44) 

V 2 V 2 

is the time evolution of the initial state \^l^). 

A last remark: the above implies that the energy (or any other observable) of a particle 

needs not be sharply defined: a particle may be 'smeared out' over several values of this 

observable. This is very similar to the fact that a wave function does not have a sharply 

defined position: simply because it is spread out over some region in space. This does not 

mean the quantum description is fuzzy or so: states of particles are well defined; it's just 

the naive point particle idea (where position, energy, ... are all unambiguously defined) 

which you need to get rid of. 

4.3.2 Recap: the postulates of quantum mechanics. 

Some people like to summarize. Some people like to highlight. For those souls, get your 
sharpened markers out, we are about to summarize this and all previous chapters in just 
one section. Why will we do so? Well, the summary below is actually due to the pioneers 
of quantum mechanics, who had been trying to understand the theory so hard that -on 
the moment they got it all straight- they couldn't resist listing the set of rules they met on 
the way. This list bears the pompous name 'the postulates of quantum mechanics'. 
These postulates are true not only for the examples we have done so far, but still hold for 
all other examples and situations we will describe in this course. So, OK, maybe they did 
have some reason to pick such a solemn name. Here goes. 

When we wish to describe a particle quantum mechanically, we use the following rules: 
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Postulate 1 

All possible states form a complex vector space, the Hilbert space. Complex multiples 
of a state \tp) correspond to the same physical state, and we prefer working with the 
normalized version: (V'1'0) = 1- 

Postulate 2 

The time evolution of a state is given by the Schrodinger equation 

ihdtW = H\i^) (4.45) 

where H is the Hamiltonian. 

Postulate 3 

Every observable (=measurable quantity of the system) corresponds to a Hermitian oper- 
ator. 

Postulate 4 

If the system is in a state that is an eigenstate of an observable O, with eigenvalue A, 
then measurement of the quantity corresponding to O will yield A as an outcome. The 
eigenstates \ipi) of an observable O form an orthonormal basis of the Hilbert space, so 
every state in the Hilbert space can be written as 

\i^) = Y.c,\^l^^)■ (4.46) 

i 

For such a (general) state, the outcome of measuring O is probabilistic. The chance that 
the measuring device gives Aj as an answer is given by 

P^(A.) = J^ (4.47) 

(If the state is properly normalized, the denominator falls away (it's one) and the above 
expression looks somewhat simpler.) After measuring O, the state of the system collapses 
into the eigenstate corresponding to the measured eigenvalue. 

{ip) — )■ measuring O results in some Xa —5- state collapses into |^a) (4.48) 

Postulate 5 

The probabilistic nature of the measurement means we can associate to a general state not 
an exact value of an observable O, but merely an expectation value. This is the average 
result one would obtain when measuring O again and again, each time making sure to put 
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back the system in its original state. The expectation value is given by the sandwich of 
the operator O inside the state: 

\0[ 

(Again, if the state is normalized, the denominator can be left out.) 



{Oh = '-^Tin^ (4.49) 



4.3.3 Conclusion 

That's it! These are the laws of quantum mechanics. If you recognized all the above 
postulates (they all showed up earlier), and understood their meaning, congratulations. 
To get here was the hardest task of this course. From now on, we will use the developed 
framework to get more insight into the quantum-world. So we will make the things we 
have learned more concrete, and extend to more general situations. What does this last 
thing mean? Well, so far we have been considering a particle living in one dimension x. 
A real particle (as described in the next chapter f.e.) lives in 3 dimensions, x, y, z. So 
to be more precise, particle states are wave functions ip{x,y,z). Also, we have not yet 
lived up to our promise to explain the structure of atoms. These and other things will be 
done in the next chapters. So some other things will be added, we will do more examples, 
but the framework (the postulates above) stays exactly the same. Even the most involved 
computations in particle physics are, in fact, a mere application of the above rules. Nice, 
no? 
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Exercises 

1. Without looking at the text, try to recall as much from the postulates of quantum 
mechanics as you can. 

2. The number operator for a particle in a harmonic potential is a Hermitian operator. 
What can you conclude about the eigenvalues of this operator. Is this indeed the 
case? 

3. For the harmonic oscillator, write out the operators X and P in (infinite) matrix 
form. Compute the expectation values of (n|X|n) and {n\P\n). Did you expect this 
result? (Note that we 'computed' {n\X\n) already in the previous chapter. Your 
computation here was a bit more precise.) By taking matrix products, find (|XP|). 

4. Write down your favorite quantum state of a particle in a harmonic potential that 
is not and energy eigenstate. What is the expectation value of the energy for this 
state? 

5. Consider an observable O (which does not involve time explicitly) and a state \tp{t)) 
(which is not necessarily an eigenstate of O or the Hamiltonian) . Obviously, {^l^\0\^l^) 
may depend on time, since the state depends on time. Show that 

dtm)\o\m) = ^{m\[H,o]mt)). 

(Hint: you'll need the Schrodinger equation.) By taking O to be the unity operator, 
prove that the norm of a state does not change in time. 
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Chapter 5 

The hydrogen atom 



In this chapter... 

One of the main motivations of the development of quantum mechanics, also mentioned 
in the introduction, was getting a better understanding of atoms and molecules. How can 
we explain the behavior of these crucial building blocks of nature? Nice enough, we have 
already collected the necessary pieces to start answering this puzzle: in this chapter, we 
will describe in detail the most elementary atom: hydrogen. 

We start with some more this-and-thats on Hilbert spaces, then comes the usual bite 
of (hi)story, and in the last part we get down to business - describing the hydrogen atom 
quantum mechanically. 




Figure 5.1: A naive depiction of the atom. This chapter, we will learn better. 
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5.1 iio Operators in three dimensions 

5.1.1 3D sensation 

In the previous chapter, we used only one spatial variable, x, on which wave functions 
could depend. That describes particles living in one dimension only, so on a line. Doing 
so makes life easy, and such a description can even be quite good for systems where the 
y and z dependence is trivial. But what if one really wants to describe a wave function 
propagating in three dimensions? In such case, we need to let the wave function depend 
on three coordinates 

il) = '4){x,y,z). (5.1) 

So that is just a minor change. As there are more variables now, we need to be a bit more 
careful with derivatives. To avoid confusion, we will write an index for each differential 
operator: 

"' = 1. "' = §-, "• = !■ <^-^' 

As the order of taking derivatives never matters, these differential operators all commute: 

d d d d 

and similarly [Dx,Dz\ = [Dy,Dz\ = 0. Now to each derivative, we can associate a corre- 
sponding momentum operator: 

P^ = -ihD^ Py = -ihDy P^ = -ihD^ (5.4) 

This makes sense, as for a particle in 3d you can measure three different components of 
the momentum. Using the grad(ient) operator V = {dx, dy, dz), the above formula can 
be written in vector form: 

P = -ihV (5.5) 

So here left- and right hand side are vectors of operators. Some people are a bit lazy, 
and drop the vector arrows on top. We will do so too, sometimes. 

Experimentally, one can also measure the total size of the momentum of a particle. 
(Regardless its direction.) Which operator corresponds to this observable? Classically, 
the square of the total momentum is given by 

\P\^ = pI + pI + pI (5.6) 

This suggests that the corresponding quantum operator is given by 

P' = Pi + P^ + p! = -f^'v' = -f^Hdl + d'y + d!) (5.7) 



CHAPTER 5. THE HYDROGEN ATOM 76 

Note that this is a single operator again - not a vector. Ah, so when going from one 
to three dimensions, the operator for the momentum squared just becomes a grad(ient) 
squared. This immediately suggests the form of the Schrodinger equation in 3d: 

ihdt'4)ix, t) = VV(^, t) + V{x)^{x, t) (5.8) 

2m 

The changes are: ip now depends on three coordinates x, the same for the potential, and 
the derivative squared is now a grad squared. Not all too different, right? 

5.1.2 Angular momentum 

When passing from ID to 3D, there is another thing that changes: in 3 dimensions, objects 
can rotate. In the same way that linear momentum is the 'amount of motion', angular 
momentum measures an object's 'amount of rotation'. The angular momentum is a vector 
I, and for a point particle with position x and momentum p it is given by: 

I = X X p (5.9) 

or writing out the cross product in components: 

I = (yPz - zpy, zpx - xpz,xpy - ypz). (5.10) 

Again the corresponding quantum operator is a straightforward guess: 

L = XxP (5.11) 

where by X we mean the vector of the position operators X, Y and Z. Writing out the 
momentum operator, we get 

L = -ihX X V. (5.12) 

So for example the x component of the angular momentum operator is given by: 

L^ = YP, - ZPy = -ih(Yd, - Zdy) (5.13) 

Mini-example 

The function /(x, y,z) = y + iz is an eigenfunction of Lx, with eigenvalue h^ since 

Lxf{x, y, z) = -ih{yi - z) = h{y + iz) = hf (5.14) 

Of course, such a function is not a decent wave function: it is nowhere near normalizable. 
But well - it's just an example. 
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5.2 iS Story: cracking the mystery of atoms 

The idea of atoms making up the basic constituents of nature is not so new. Even some 
greek philosophers thought of the possibihty of fundamental, indivisible pieces of mattei|j. 
Almost two millennia later, the first scientifically motivated atomic theories came to life. 
Through a long and fascinating history of discoveries, it became clear that there really have 
to be basic constituents of all matter: atoms. First a mere model, more and more proofs 
of their existence accumulated, up to their classification, and finally even the possibility 
of 'looking' inside those strange building blocks. At the end of a long debate, scientists 
got convinced that atoms have to be made of a small positive core, with a cloud of 
electrons around it. For a moment, one could have thought that the mystery of atoms was 
unraveled. Electric attraction indeed explains that such a cloud of electrons would stay 
close to a positive core, orbiting around it. It could not be more beautiful, the smallest 
known scale in almost perfect similarity to our very own solar system. However prefect 
this may seem, there were some serious problems indicating that the mystery was not 
completely solved yet. 

5.2.1 The end of the world 

From your basic electromagnetic course, you know that opposite charges attract. A bit 
similar to gravitation keeps a planet circling around a star, a light negative charge would 
neatly orbit around a heavy positive one. But there is a problem. The circular movement 
means that the orbiting charge experiences an acceleration: the centripetal force realized 
by the electrostatic attraction. Now a more advanced course on electromagnetism (whether 
you had one or not) will readily tell you that an accelerated charge necessarily radiates 
away energy. Roughly speaking, the circular movement of the charge gives rise to a 
changing electric field, in such a way that electromagnetic waves are created. This is 
similar to a moving object in water: it creates waves that carry away energy from the 
moving object. So an orbiting charge will gradually lose its energy, and is deemed to 
(after some time) end up at the positive core. One could think this process just takes 
long, and therefore does not occur on visible timescales, but it turns out that such a 
collapse would go very quickly. So if the classical view on the electron and nucleus was 
correct, they were doomed to collapse quickly. Sounds very much like the end of the world, 
which we luckily don't see, so something else has to be going on inside the atoms around 
us. 



^Actually, 'atom' comes from the Greek word for indivisible. 
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Figure 5.2: If electrons were point particles, they would lose all their energy by the emitted 
radiation, and spiral down to the nucleus. In no time, all atoms would collapse, and the 
universe would end in a drastic collapse. Obviously, this is not what happens. 



5.2.2 Balmer and friends 

Nevertheless, there are some things that do make sense if you think of an atom in the 
'planetary' way. If electrons are orbiting around some core, you expect that they can 
change their kinetic energy. So if one invests energy in an electron, it will just move to 
a larger orbit. If it falls back to the smaller orbit, the absorbed energy will be emitted 
again. This suggests atoms can absorb and emit energy. And such was indeed observed. 
At the end of the 19th century, the pioneers in spectroscopy had found clear indications 
that some gases absorb light (so, energy) and emit it again. To the scientists working on 
the atomic model, it was clear that this absorption of energy had to do something with 
the electrons of an atom absorbing it, and spitting it out again later. But there was a very 
strange peculiarity. Gases did not seem to absorb any radiation, but only at very specific 
wavelengths. For hydrogen gas, this behavior was put into an empiric formula by Johann 
Balmer. He found that absorbed wavelengths Aabs were all of the form 

1 / 1 1 \ , , , 

oc -^ 9 with n some integer > 2. (5.15) 



Aabs V22 n\ 

Other scientists later found some other absorbed wavelengths, and these are were all of 

the form 

1 „ / 1 1 



Rh —9 9 with ni < 712 integers. (5.16) 

The above equation is called the Rydberg formula, after the discoverer of this general- 
ization of Balmer's formula. The number Rh is the Rydberg constant. This formula 
very strongly suggests that the an electron in a hydrogen atom can only 'sit' on special 
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energy levels. To see that, first note that for light, the inverse wavelength is proportional 
to the frequency. Indeed, the product of wavelength and frequency is the wave speed (light 
speed): 

fX = c (5.17) 

So because the speed of light c is just a constant, we have ^ ~ /• Next the frequency 
of a light quantum (a photon) is proportional to energy: E = hf. So A",^^ is really a 
measure for an energy difference: the energy of a photon emitted in a transition between 
two electron orbits. So the Rydberg formula implies that the allowed energy differences 
are given by 

AE oc { —^ ^ I with rii < n2 positive integers. (5.18) 

This then suggests the only energy levels occupied by the (hydrogen) electron are given 
by 

E = Eq—^ with n positive and integer. (5.19) 

With Eq a constant. If you track down the constants involved in all proportionalities, you 
can check that Eq = hcRn- 

5.2.3 Bohr 

This observed discreteness of the atomic spectrum of hydrogen (and other atoms) is of 
course not so easily explained by classical mechanics. The only situation in which physicists 
had been confronted with discrete amounts of energy was the photon-hypothesis. As 
explained in the introduction, the photoelectric effect (amongst other experimental results) 
showed that light has to consist of discrete packets, of which the energy is determined 
completely by the frequency of the light: 

E = hf{= hw). (5.20) 

In part inspired by this result, the physicist Niels Bohr cooked up the following explanation 
for the energy spectrum of hydrogen. He suggested that electrons around an atomic 
nucleus can only move on circular orbits, whose angular momentum is an integer multiple 
of Planck's constant h. This assumption was never derived or even slightly motivated. 
Professor Bohr just pulled it out of the hat. Any motivation (or lack of it) aside, let us try 
to see what the consequence is of such an assumption. If we take the origin of the system 
at the nucleus, the length of the position vector f of the electron is equal to the orbital 
radius r. The momentum is of course given by mv with m the electron mass, and v its 
velocity. (Although the direction of the velocity changes throughout the circular motion, 
its size remains the same.) Since the velocity is perpendicular to the position vector we 
have |r X pj = |r||p| and the size of the angular momentum is given by 

/ = |/| = |r X pj = rmv. (5-21) 
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n=3 




Figure 5.3: According to the Bohr model, electrons can move on specific orbits only. As 
a consequence, an electron falling from one energy level to another (here from n = 3 
to n = 2) can only emit a specific amount of energy. For the same reason, only specific 
amounts of energy can be absorbed by an atom. This explains the findings of spectroscopic 
experiments and the formula of Rydberg. 



So Bohr's hypothesis amounts to 

rmv = nh with n a positive integer. 



(5.22) 



Which orbits satisfy this relation? First note that the centripetal force is given by the 
attraction between the nucleus and the electron. Putting in the charge of the proton (+e) 
and the electron (— e): 



mv 



centr 



coul 



k- 



So that 



ke^ 



rav^ 



for every circular orbit. This allows us to rewrite Bohr's hypothesis (j5.22p as 

ke" 

= nn. 

V 

We are now getting close. The energy of the system is given by 



E — Ekin + E, 



^pot 



2 l.^ 

mv — k — 



1 



-mv 



(5.23) 
(5.24) 

(5.25) 
(5.26) 
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where in the last step we used ()5.23p . Plugging in Bohr's hypothesis ()5.25p . we find that 
the energy of the electron is given by 

E = -lm(^) =-Eo^ (5.27) 

2 \ nn J n^ 

with n a positive integer and Eq = ^m ( -|- J a numerical constant with units of energy. 
Woah, ye pirate, Bohr! Using his seemingly random hypothesis, we find a discrete set of 
energies. And not just some set, precisely the one matching Rydberg's formula. Indeed, 
it has the right -^ form, and moreover the numerical value of Eq turns out to coincide 
sharply with the Rydberg constant Rh- 

5.2.4 A better explanation? 

That's all very nice of Bohr. The great trick he pulls is just assuming a discrete behavior 
of the angular momentum, which then in the end of course leads to a discrete behavior 
of the energy. However perfect the final result matches with experiment, it just shifts the 
problem to another one: why is the angular momentum discrete, then? It also does not 
answer the problem raised in the 'end of the world' section. The model still describes an 
electron moving on circles, which should radiate away all its energy. 

What can we do about that? Well, first of all we know that the electron should 
be described by a smeared out wave function, not a circular orbit of a point particle. 
In the examples we have met so far, solving the SE of such a wave function leads to 
an eigenfunction problem. Moreover, in the examples we have seen, this eigenfunction 
problem only had a special set of solutions. So a wave function 'locked up' in a attractive 
potential naturally has a discrete set of energy states. This is a much more satisfying 
answer than the magic Bohr pulled out of his hat. But can we also reproduce this result 
here? It's to say, if we solve the SE for the hydrogen atom, do we get the right spectrum? 
We will see in the next section... 

5.3 HS The quantum approach 

Let us try to apply what we have learned in the previous chapters, and understand the 
hydrogen atom using quantum mechanics. We know that the nucleus is very small, and 
that it creates a potential for the bound electron: 

V = -k—. (5.28) 



With r = y x^ + y^ + z^ the distance to the origin. This means the Schrodinger equation 
for the electron wave function reads 

ihdti; = (y A ] V' (5.29) 

V 2m I 
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Here V = —ke^l\Jx^ + y'^ + z'^ is the potential, ip = ip{x, y, z) the three-dimensional wave 
function, and A = 5^ + 5^ + 5^ the Laplacian. This gives a first situation where we have 
to deal with a realistic (and fully 3D) problem. In principle, the only thing one need to 
do to understand the quantum behavior of an electron in a hydrogen atom is to solve the 
above equation. Again, we first look for energy-eigenstates. The time-evolution of these 
states is trivial (e^*^*'^'), and the most general state of the Hilbert space is just given by 
taking linear combinations. So the only real problem is solving: 

E^=(v -—^il^. (5.30) 

\ 2m J 

Phrased differently: we want to find the eigenfunctions of the hamiltonian H = V — 2^^- 

5.3.1 A better coordinate system 

The above equation is not so easy to solve. The naughty element is the potential, which 
involves a square root. Actually, since the potential (and hence the entire problem) is 
spherically symmetric, it is natural to switch to an adapted coordinate frame. Spherical 
coordinates (r, ^, (/?) are related to Cartesian ones by 

X = r cos 99 sin 6 (5.31) 

y = r simp sm9 (5.32) 

z = r cos 9 (5.33) 

or 



r = -x/x^ -K y2 _^ -,2 (534) 

93 = arctan(y/x) (5.35) 

9 = arccos (5.36) 

\ yx^ +y^ + z^ J 

The ranges are r E [0, oo[, 9 G [0, vr] and 99 G [0,27r[. If we want to rewrite the energy- 
eigenstate equation ()5.30p in terms of {r,9,ip), three things change. First, we need 
to replace ilj{x,y,z) by ip{r,9,if). Next, the potential just becomes —ke^/r instead of 
—ke^ I \l x^ + y^ + z^ ■ The last thing is to rewrite the Laplacian in terms of derivatives dr^ 
de and d^p. Doing so is unusually nasty (not even fun) and we only sketch the procedure 
here. For example, to express 0^, we remark that 

9. = ^ = |^a. + |^5, + ^a, (5.37) 

oz oz oz oz 

This is just the chain rule for derivatives. (Both sides are operators, so one should imagine 
them acting on a function. Writing a function on the right of both sides, you indeed see 
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nothing but the chain rule.) You can then compute the three partial derivatives occurring 
on the right-hand side, using the expressions for r{x,y,z), 6{x,y,z) and ip{x,y,z) and 
next express the results back in spherical coordinates. This way you can obtain 

dz = cos 6dr de (5.38) 

r 

Now to get d^, one must be a bit careful. Once again, recall that we are dealing with 
operators. So for example the square of an operator (rdj-) would be 

{rdrf = {rdr){rdr) = r{dr + rdl) / r'^dl (5.39) 

If you are confused by these steps, just write a general function / on the right of every 
term. In the same fashion, one can obtain an expression for d^. (We omit it here.) 
Doing the same for d"^ and dy and taking the sum, one obtains the Laplacian in spherical 
coordinates: 

1 d f d\ 1 d f d \ 1 d"^ 

r'^ dr \ dr J r"^ sin 9 89 \ 89 J r'^ sin^ 9 8(p'^ 



Back to the equation 

After this sidestep, let us move on to what we are here for, solving 

E^Pir, 9, if) = (vir) - ^a) ip{r, 9, ^) (5.41) 

With A given by (|5.4Up . Let us suppose that the wave function ip{r,9,ip) can be written 
as 

iP{r,9,ip)=Rir)F{9,^) (5.42) 

This is called separation of variables. In terms of R and F, the energy eigenstate 

-2n 



equation (|5.41|) becomes (after multiplying by —M^r"^) 



(^ 8 8E 1 8'^ E\ 

sm 9 86 89 sur 9 8ip^ J 

This looks pretty disastrous, but there is an important simplification. Formally, the above 
says 

F(some function of r) + i?(some function of 9 and c^j) = (5.45) 

So assuming R and F are not zero - a typical kind of physics sloppiness: 

(some function of r) (some function of 9 and w) 

^ R = "^ F ■ ^^-^^^ 



CHAPTER 5. THE HYDROGEN ATOM 



84 



The last equation is very special. On the left hand side things only depend on r, whereas 
on the right hand side only on 6 and (p. This can only be true if both sides are equal to 
some constant, say C. This then reduces our problem to solving 



1 d dF 1 a^F 

^^'do^ ~de^ + sin2 e d^^ 



-CF 



,d^R 



dr 



+ 2r' 



dR 2m 
dr h? 



\V{r)-E)R = CR 



(5.47) 



(5.48) 



Ah, we have cracked down the original equation in two parts! That's good news. The first 
part is called the angular equation, the second is the radial equation. We will solve 
them separately. 



5.3.2 The angular equation 

For those who were masochistic enough to have done the Laplacian calculation, another 
goodie. Verify that the angular momentum operator, defined in the beginning of the 
chapter, has the following form in spherical coordinates: 



L' 



LX+ Ly+ L^ 



-h' 



]_d_ 

~ed0 



sm 



. ^d\ 1 d^^ 



(5.49) 



If you have better things to do in your life than performing tedious algebra, just believe 
the above statement. It will leave with more energy to discover something special: the 
angular equation ()5.47p is just an eigenvalue equation of the angular momentum operator: 



L^F 



h^CF 



(5.50) 



Now this equation turns out to be well-studied in physics. It shows up in virtually any 
problem with spherical symmetry (quantum or not). It only has solutions for the following 
values of C: 

C = e{i+1) where £ = 0,1,2,... (5.51) 

Striking enough, for each such C (except for C = 0), there are several solutions. So to be 
clear on which solution we are talking about, not only do we need to specify i, but also an 
extra label distinguishing the different solutions corresponding to that i. It turns out that 
the different solutions corresponding to the same i are all eigenfunctions of the operator 
Lz, but with different eigenvalues. This means we can use the eigenvalue with respect to 
Lz as the extra label needed to specify the solution uniquely. This label is conventionally 
called m, although it has nothing to do with mass. The corresponding eigenvalues of Lz 
are hm. In conclusion: the angular equation L^F = fi?CF has an infinite set of solutions, 
denoted ¥[^{6, ip), where the labels £ and m are related to the eigenvalues of the solution 
with respect to L and L^: 

L'^Yi'^ = h^e{£ + l)Y;;^ (5.52) 
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and 



L,YJ^ = hmYf" 



(5.53) 



It turns out that the values m can take are bound by ±1: so m = 0, ±1, ±2, ... it i. The 
functions YP'{6, ip) are called spherical harmonics. To see where this name comes from: 
a function that behaves 'well' under the Laplacian is called harmonic, and L^ is just the 
angular {= spherical) part of the Laplacian. 

The general form of the spherical harmonics Y[^{9, ip) can be written down explicitly. 
This expression is not so illuminating, so we just restrict to giving a table of the first few 
solutions (smallest i and m): 



e 


m 


vrn 








\lh 


1 





yjcos0 


1 


±1 


^^l.sin^ei^'. 


2 





75^(3 cos^^-l) 


2 


±1 


yif cos^sin^e^^^ 


2 


±2 


.b^ sin2^e±2^^ 

V 327r 



You also see that m runs from —I to +i indeed. Next, all the spherical harmonics have 
a simple 93-dependence, of the form e*™*^. This guarantees they are eigenfunctions of 
Lz = —ihd^p with eigenvalue hm. 



5.3.3 The radial equation 

Having solved the angular equation, let us move on to the radial part. Using C 
we can write the radial equation as 



d'^R 2dR 



r^ ar 



h^R 







? + l), 



(5.54) 



with a 






and —6^ = -S— . Note that the —6^ is proportional to the energy E. Classi- 



cally, an elektron is bound to an attractive potential is its total energy (kinetic+potential) 
is negative. This is also the case in quantum mechanics. So we really are interested in 
solutions where E is negative, meaning & is a real constant. For large r the middle terms 
are small and the equation becomes 



d'^R 

Qj.2 



b^R 







(5.55) 
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So i? oc e . The solution with a plus sign is a bit pathetic: the wave function would 
then blow up for large r. We conclude there is only one healthy asymptotic behavior: 
R oc e~ . This gives a hint on what solutions to the full equation ()5.54p might look like: 

-br 



R{r) 



(some function of r) • e 



Concretely, by defining 



the radial equation becomes 

Qj.2 



R{r) 



or 



u{r) 



-br 



' + 11 



ar 



u 



0. 



(5.56) 



(5.57) 



(5.58) 



Now what kind of solutions do we want? We need u{r) to be a function of r, but it should 
be decent enough, so that the asymptotic behavior is still dominated by the exponential. 
Studying the asymptotic behavior more carefully, one can show that u has to be a poly- 
nomial. (Otherwise, R{r) will blow up for large r.) So the problem of solving the radial 
equation has been reduced to: 'what polynomials satisfy the equation above?' Ah, again 
this is a well-understood problem for mathematicians. Don't you just love them? The 
problem only has solutions when 

with n a positive integer (5.59) 



ab 



n 



Also, we need £ + 1 < n for a solution to exist. For these special values of a and b, the 
solution of ()5.58p is given by 

u{r) = r^+'Ll%l^{2br) (5.60) 

with L^ the so-called generalised Laguerre polynomials. They are defined by 



and the first relevant combinations become: 



g-x^a+/3 



(5.61) 



n 


i 


r'+'Ll'+,'_,{2br) 


1 





r 


2 





r{-2br + 2) 


2 


1 


r' 


3 





r{2b'^r^ - 6br + 3) 


3 


1 


r\-2br + 4:) 


3 


2 


^3 



Again, you can check for your favorite element of the above list that it satisfies the cor- 
responding differential equation. Returning to the original function R, the solution corre- 
sponding to a particular value of n and I is 

R{r) = r'Ll'+l,{2br)e-''' = r'Ll'+l,{2br)e-''' (5.62) 



CHAPTER 5. THE HYDROGEN ATOM 87 

5.3.4 Conclusion 

Congratulations! You have gotten through solving the radial and angular part of the 
Schrodinger equation of the hydrogen atom. The conclusion is: there is an infinite set of 
solutions, labeled by the numbers n, i and m. They are given by 

i^n,e,mir,9,^) = Rir)F{0,ip) (5.63) 

= Nr^Ll%l^{2br)e-'^Yf'{0,^). (5.64) 

Where 

n = 1,2,3,... (5.65) 

i = 0, l,...n-l (5.66) 

m = -e,-£ + l,...£-l,e (5.67) 

A constant N has been included to take account of the normalization of these solutions. 
You may wonder what those numbers n, i and m mean physically. Because the operators 
L^ and L^ involve only radial coordinates, equations (|5.52p and (|5.53p also imply that 

L^i^nAm = n^£ii+l)i^n,i,m (5.68) 

Lz1pn,Lm = hrmpriAm (5.69) 

Hence the state \tpn,e,m) has (squared) angular momentum h'^i{£+l), and the z-component 
of the angular momentum is hm. The label i is called the orbital quantum number 
and m is the magnetic quantum number. Furthermore, using the expressions for b 
and a, the energy is given by 

~ ~^2rn ~ 2m ~ ~^¥n^ ^ ' ' 

Defining again £'0 = ™^^/ (w 13,6ey), the energy is given by 

E = -Eq\. (5.71) 

So the label n determines the energy. It is called the principal quantum number. 

Although it took us quite a bit of work to get here, this is a very beautiful result. 
Not only have we found the wave functions that can be taken by the electron of the 
hydrogen atom, we have also found the corresponding values of the energy. On top of 
that, these energies precisely coincide with the measured energy levels. (Since the result 
is in accordance with the empirical formula of Rydberg.) In contrast to Bohr, we did not 
have to make some funny assumptions. Also, these wave functions just sit there: they 
are not an orbiting point particle as Bohr assumed, which had the problem of inevitably 
radiating away its energy and ending up in the nucleus. So in some sense the quantum- 
approach also solves the 'end of the world'-problem. More precise: there is a state (n = 1 
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and i = m = 0) that has the lowest energy: this is the ground state. When the electron 
is in this state, there is no possible process taking it to a lower energy state. The system 
thus is stable, and cannot decay or collapse. This contrasts with the numerous plagues 
one encounters when trying to describe the atomic structure using classical mechanics. 

Closing remarks 

We have obtained our desired solutions, but it might be clarifying to spend some more 
time on them. First, as you may know, the different possible states of the electron in the 
atom are called orbitals. There is an alternative notation to indicate the principal and 
orbital quantum number of each orbital: write the value of n and then the letter s, p, d, 
f, ... for £ = 0, 1, 2, 3, .... So for example 

Is o n = l,e = (5.72) 

2s o n = 2, i = (5.73) 

2p o n = 2,e = l (5.74) 

; (5.75) 

etcetera. The strange letters indicating the orbital quantum number derive from some 
purely visual aspects in spectroscopic experiments: they stand for s(harp), p(rincipal), 
d(iffuse), f (undamental) . Here are some figures showing the norm of the wave function 
for the lowest energy states of the hydrogen atom. The brighter the color, the larger the 
amplitude. 
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Some very nice things can be seen here. First, note that the higher the energy, the 
further away the wave function is located from the nucleus. Indeed, for n = 3, the 
wave functions are more spread out ('larger') than for n = 2 or n = 1. This is the 
quantum analog of the classical expectation that for higher energies, the electron should 
be occupying a larger orbit. Let us be a bit more quantitative on the size of these orbitals. 
Using the fact that Lo is a polynomial of degree /3, you can check that the leading term 
in the wave functions ()5.64p is given by 

^nAm ~ r"-ie-'-/('^«) (5.76) 

where we have used b = —. The function on the right falls off quickly due to the expo- 
nential. The region over which it has an appreciable size, can be estimated by looking at 
the location of its maximum. There always is one, and you can check that it is given by 

^(^n-ig-r/(na)) ^ Q ^ r = n^a - na (5.77) 

or 

So we see that for larger n the wave function is approximately limited to a region of size 

state size ~ n'^a (5.78) 

The constant a is called the Bohr radius. Its numerical value is 

a = 0.529 • 10~^° m (5.79) 

and this is indeed the approximate size of the hydrogen atom. (Others atoms are larger 
since they contain more electrons). So the size of the wave function grows quadratically 
with n. This explains to some extent the stunning success of Bohr's naive model. His 
mysterious assumption describes classical orbits, of which the radii just happen to coincide 
with the actual size of the quantum wave functiouQ On top of that, the corresponding 
classical energy to such an orbit just so happens to coincide with the right (quantum 
mechanical) value. For more complicated systems (atoms with more than one electron for 
example) there are no similar 'smart guesses', and a quantum description is the only tool 
to get the right predictions. 

One other observation is that the solution we found, predicts that there are in general 
several states with the same energy. (Since the energy only depends on n, not on i or 
771.) This phenomenon is called degeneracy and finally is something that really contrasts 
with the Bohr description. It is possible to do very precise measurements and see the 
different states corresponding to the same energy. This again confirms that the quantum 
mechanical description really is the right one. 

Finally, some comments on the angular part of the wave function. You see that for 
increasing i the spherical harmonics get more and more wobbly: higher and higher powers 



^Just combine (|5.24|) and H5.25|l to get an expression for r, only involving n and some constants, but 
not V. You get precisely the relation r = n^a. 
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of sin^ and cos^ occur. This trend continues for greater i. Also the <p dependence e*"^*^ 
becomes more and more oscillating. This is the quantum mechanical manifestation of 
angular momentum. The stronger a wave function varies along the angular coordinates, 
the more angular momentum it contains. This is a bit similar to the linear momentum of 
a state. Since P = —ifiV, the momentum of a state is proportional to its variation along 
the coordinate axes, its linear 'wobbliness'. Of course, these statements are a bit shady: 
they only give a heuristic feel for the manifestation of angular and linear momentum in 
the language of wave functions. The right way to do this, is of course looking at the 
eigenvalues (or otherwise the expectation values) of all these operators. 

You can now say you truly understand what the hydrogen atom is. Historically, the 
fact that its structure can be fully derived from the Schrodinger equation without any 
extra assumptions was a very important sign that his equation (and quantum mechanics) 
are the right way to describe particles. 
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Exercises 

1. What is the wavelength a photon should have to excite a hydrogen atom from the 
third to the fourth level? 

2. Pretend an atom has a cubic shape, and that you can stack hydrogen atoms tightly 
next to each other. How many atoms then fit in a cubic centimeter? Estimate the 
number of sand particles in the Sahara desert. (Surface: ~ 2500 km x 4000 km, 
depth ~ 100 m.) Would you say an atom is pretty small or very small? 

3. Using ()5.49p and the fact that L^ = —ihd^ in spherical coordinates, check for your 
favorite element of the table of spherical harmonics that it is an eigenvalue of L^ 
and Lz, with the right eigenvalues. 

4. Verify the transition of ()5.54p to ()5.58p . 

5. If you bombard the electron with too much energy, it will be kicked away the hy- 
drogen nucleus. Concretely, the states with high n are only very weakly bound, and 
they are very close to an ionized state. How much energy do you need to kick an 
electron from n = to the 'highest state' n = oo? If you impact a 20 eV photon on 
a hydrogen atom, how much kinetic energy will the (now free) electron have? 

6. Challenge: consider a hydrogen-like atom: a nucleus with charge +Z and a single 
electron around it. (For example, the case Z = 2 corresponds to He"'', and Z = 1 is 
just the hydrogen atom.) The potential is then Z times bigger. Skim through the 
computation of this chapter, and check which things change. Find the expression for 
the wave function. Hint: the functions occurring remain the same but everything 
(prefactors, function arguments) gets rescaled. If you do all well, you should in the 
end find that the energy levels of such a system are given by En = —Eq^. 

7. Consider a particle in a box - a true one, in three dimensions, with volume L^. Write 
down the Hamiltonian: what is the kinetic part, what is the potential V{x,y,z)7 
Write down the time-independent Schrodinger equation. To simplify things, use 
separation of variables: put 

ip{x,y,z) = ^pl{x)^p2{y)^3{z) 

Show that the equation breaks apart in three separate equations. Do you recognise 
these equations? What are the solutions? Write down the most general solution of 
the 3D system, and show that the energy levels are of the form 



h^TT 



2mL2 
with ni, n2 and 77.3 integers. 



2 

■ (n^ + n2 + ng) 



Chapter 6 

The commutator 



In this chapter... 

We will come back to the result we got for the hydrogen atom. We address some small 
issues, giving another view on the situation which (hopefully) extends your understanding. 
A technical tool that will be important for this discussion is the commutator. This concept 
has come up already several times, and it will re-appear even more often, so that is why 
it deserved this chapter's title. We start by improving our commutator computing skills, 
then (as promised) we spend some more time pondering about the result of the previous 
chapter, and we conclude by several small illustrating computations. 
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6.1 iio Cominute along, cowboy 



In the chapter on the measurement, we introduced the commutator of two operators. It 
is defined as 

[A,B]=AB-BA. (6.1) 

If A and B were ordinary numbers, the above would obviously be zero. For operators 
however, the order in which you let them act on a function might matter; in that case the 
commutator is nonzero. Here in this section we briefiy give some facts on commutators. 
Most of them are easy to check, just by writing out everything explicitly; it is a good 
exercise to do so. First of all, commutating is a linear operation: 

[aA + bB,C] = a[A,C] + b[B,C] (6.2) 

for complex numbers a, b, and operators A, B, C. Similarly 

[A,bB + cC] = b[A,B] + c[A,C] (6.3) 

The commutator of products is given by the 'drag out sideways '-rule (A%): 

[A,BC] = [A,B]C + B[A,C] and [AB,C] = A[B,C] + [A,C]B. (6.4) 

Which can again be seen by just writing out both sides - carefully retaining the order of 
operators within in each term. We have also seen that 

[X,D^] = -l (6.5) 

We showed this property before, but just to refresh your memory, a concrete mini-example 
illustrating it (for -say- the function x^): 

[X,D^]x'^ = XD^x"^ -D^Xx'^ =X{2x) - D^{x^) = 2x^ -'ix^ = -l{x^). (6.6) 

Of course the same holds for other variables: \Y,Dy] = [Z,Dz] = —1. As a consequence 
(using Pi = -ihDi) 

[X,P^] = [Y,Py] = [Z,P,] = ih (6.7) 

The above is sometimes called the canonical commutation relation. On the other 
hand, mixed position-momentum commutators are zero: 

[X,P,]=0 (6.8) 

and similar for any other combination of two different variables. To see this, just note 
that for any function f{x,y,z), 

XDyf = X{dyf) = Xdyf = dy{xf) = DyXf (6.9) 

and similar for any other combination of two different variables. If the before last step is 
not immediately clear, just use the product rule for differentiation to see that 

dyi^f) = idyx)f + x{dyf) = + xdyf. (6.10) 

We're done with the first part. Not all too harsh, right? 
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6.2 i:!__J Asking those questions... 

In the precious chapter, we uncovered the quantum structure of the hydrogen atom. You 
may have felt a bit hke a passive audience though: most results fell out of the sky. In 
particular, the fact that we have to use three labels to specify each state seems to have 
emerged 'miraculously'. Of course, you may be satisfied with this, but if you are critical, 
there are several questions you can ask. First, notice that the third quantum number (m) 
describes the z— component of the angular momentum. Why this one? What is special 
about this direction? Didn't we start out with a perfectly spherically symmetric problem? 
Also, why did we find three quantum numbers, and not (say) four? How could we have 
seen this coming in advance? These are all quite good questions. Below, we will answer 
them in detail. Hopefully this adds some extra understanding to the matter. 

6.2.1 Z-supremacy? 

Why did we find the states of an electron in the hydrogen atom to be eigenstates of L^, 
and not Lx or Ly? Here is the explanation. Recall that we found that given the total 
angular momentum (labeled by i) there were several solutions to the angular equation. 
So given some i, there are several solutions to 

L'^F{9, f) = h^iie + 1)F{9, <f). (6.11) 

Now here is the key observation: the above equation is linear - just like the Schrodinger 
equation. So if you have two solutions Fi{6,ip) and F2{9,ip) then any linear combination 
aFi{9, ip) + bF2{9, ip) is a solution too. Thus, when we said there is more than one solution, 
we should actually have said: there are infinitely many solutions, and they form a vector 
space. This vector space can best be described by picking a basis - but which one you 
choose is up to you. In the previous chapter we picked a basis, namely functions Y[^, 
which (on top of L^) were also eigenfunctions of the operator L^, with eigenvalues m. Of 
course, it is possible to pick any other basis. In fact, it is equally possible to pick basis 
elements that are eigenfunctions of LP' and L^ (or of L^ and Lx). In such a case, you 
would label your basis states by i and their eigenvalue under L^ (or Ly). So there really 
is nothing special about the z-direction. Using L^ just happens to provide the simplest 
looking basis of the space of solutions of (|6.1ip . That is why people like to use that one. 
Originally, this comes from the fact that in the (r, 9, (p) system, Lz has a very simple form, 
—id^. The other two angular momentum operators look a bit more difficult, and so do 
their eigenfunctions - but nothing fundamental distinguishes them. As an illustration, we 
will actually perform the change of basis explicitly in the computation part of the chapter. 

Orthogonal states 

Here is another question that might have come to your mind. Are the states we found 
an orthogonal basis? As emphasized in the chapter on observables, such a basis is very 
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valuable. We have also shown that eigenstates of an observable with different eigenvalues 
are automatically orthogonal. Here, things look a bit different at first: since there are 
several states with the same n. How does this work? First, let us be more clear on what 
we mean by orthonormal in this situation. Denoting by \n^m) the state corresponding to 
'4'nim{x), we can express orthonormality by: 

{nimln i'm) = Snn'Su'dmm' ■ (6-12) 

Let us think about this. States with different n have a different energy. That assures that 
{'^nim\'4^n'i'm') = whenever n ^ n' . But what if n = n'? Ah, remember that if i is the 
eigenvalue under L^. This is a Hermitian operator too, as it corresponds to the observable 
'angular momentum'. Hence, states with different eigenvalues i under L^ are orthogonal 
too. And in the same fashion, eigenstates with a different m, have different eigenvalues 
under the (also hermitian) operator L^, so that is sufficient to guarantee orthogonality 
too. In conclusion, the inner product {ipnimlil^n'i'm') will be zero whenever one of the three 
quantum numbers n, i, or m are different. Stated differently: the above inner product is 
only nonzero when taking the inner product of a state with itself. If one also makes sure 
states are normalized {{tpneml'^nem) = 1) the equation (j6.12p is indeed satisfied. 

6.2.2 CoSCO 

Let's imagine what would happen if you were a computing beast which could solve differ- 
ential equations simply by looking at them. Then, studying the hydrogen atom quantum 
mechanically would go a bit like this. You write down the Hamiltonian for an electron 
bound to a nucleus. You glance at that time independent Schrodinger equation (the eigen- 
function problem) and instantly solve it - without having to use silly tricks (splitting it 
into a radial and angular equation) which normal humans have to rely on: you just write 
down an infinite tower of solutions (which span the Hilbert space) labelled by their energy, 
and that's it. 

But suddenly something bothers you: there are several states belonging to each energy 
level. The entire set of solutions with the same energy actually forms a vector space, since 
the sum of eigenfunctions with the same eigenvalue is again an eigenfunction. Such a 
vector space is called a subspace of the total Hilbert space. This is not a problem per 
se, but you'd like to find a decent, orthogonal basis of the Hilbert space - and picking a 
random basis of energy eigenstates -like you just did- is no good: states with the same 
energy eigenvalue are not necessarily orthogonal. 

So how do you find a basis that is orthogonal then? You glance again (still in beast 
mode) and immediately realize that for every subspace, you can find a special set of 
states which are eigenstates of both L^ and L^- Picking these special states as your basis 
(elements of this basis are now labeled by their eigenvalue under H, L^ and L^) you have 
now obtained an orthogonal basis! Indeed, each two elements of your basis have a different 
eigenvalue with respect to at least one Hermitian operator {H or L^ or L^) so they are 
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guaranteed to be orthogonal. This finahy gives you an orthonormal basis \nim). (Being 
mere humans, we needed more work to obtain this basis.) 

Since all states of the Hilbert space basis can be labeled by their eigenvalues under the 
observables H, L? and L^, these are called a complete set of commuting observables, 
or CoSCO for short. Let's clarify. The term 'complete' means they specify every state 
uniquely: no two different states can be found with the same eigenvalues. Next (as 
suggested by the name) such a set of observables necessarily commutes. This is not too 
hard to show. Suppose one has two observables A and B and a basis of states \ij), which 
are eigen to A with eigenvalues Oj and eigen to B with eigenvalues bj. So 

A\ii)=ai\ij) and B\ij)=bj\ii) \/i,'ij (6.13) 

So i specifies the yl-subspace to which a state belongs, and j the S-subspace. Then it 
follows that for a every state \ip) = ^^ ^ • Oiij\ij) in the Hilbert space 

i J 
i 3 

= ^^bjaiaij\ij) 

i j 

= B^^aittijlij) 

i j 

= BAaij\ij) = BA\^) (6.14) 

At several points the linearity of A and B was used. (This is OK because all operators in 
quantum mechanics are linear.) So AB\ip) = BA\i{j) for all states \ip) in the Hilbert space, 
meaning 

AB = BA so [A,B] = (6.15) 

Indeed, A and B must commute. This explains the second C in CoSCO. 

6.2.3 Compatible observables 

The above combined with our finding of the states \nim) implies that all the commutators 
between H, LP' and L^ have to be zero: 

[H,L^] = [H,L,] = [L',L,] = Q. (6.16) 

But also a converse of the previous section is true: If the commutator [^4, B] of two 
observables is zero, then it is possible to find states that are eigenstates of A and B in the 
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same time. Because of this property, commuting observables A and B are sometimes called 
compatible observables. But not any two observables are compatible. For example, we 
will see in the computation part that [Ly,Lz] and [Lx,Lz] are not zero. This means that 
it is not possible to find a basis of states that are simultaneous eigenstates of more than 
one component of the angular momentum. So generally, if a state is an eigenstate of one 
of the components of the angular momentum, it can not be an eigenstate of any of the two 
other components. With respect to one of these two other components, it must then be 
a superposition of several eigenstates, and measuring one of these other components will 
thus always yield a probabilistic outcome. This has very important consequences: in fact, 
it is the underlying reason for the so-called 'uncertainty principle', which we will discuss in 
Chapter 8. Anyhow, we have now gotten the answer on the last question we posed at the 
beginning of this section. We now know that there are only 3 quantum numbers necessary 
(and not -say- 4) to describe states of the hydrogen atom because there are precisely three 
compatible observables for that system, no more and no less. Which set you take (f.e. 
{H,L'^,Lz) or (H, L'^, Lx) or ...) still has some freedom, but you always need three of 
them. 



6.3 lii Filling in the gaps 



In this section we fill in three gaps that we left open in the story part. First, we explic- 
itly perform the basis change that we were talking about in the 'z-supremacy'-section. 
Next, we give a baby-example of another problem where a CoSCO shows up. Finally, we 
compute the commutation relations between different components of the angular momen- 
tum to show that they are non-zero. This proves the different components of the angular 
momentum are non-compatible observables. 

6.3.1 Gap 1: Changing basis 

Let us denote the states |2, 1, —1), |2, 1,0) and |2, 1, 1) by the vectors 



and (6.17) 






So the vector 




(6.18) 



is a shorthand for the state a |2, 1, — 1) -|- 6 |2, 1, 0) -|- c |2, 1, 1). The advantage of this sym- 
bolic notation is the following. Operators acting on these three states can be represented 
by a matrix acting on the corresponding vector. Indeed, the action of L^ for example can 
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be written as 




L^ = h\ (6.19) 

VO -1/ 

So for example the equation L^|2, 1,-1) = {—h)\2, 1, —1) is represented in matrix form by 

= i-h) I J (6.20) 

Similarly, the operator L"^ acts on these states as 

/I 

L^ = 2h^ \ 1 I (6.21) 





because all three states are eigenstates of L^ with eigenvalue /i^^(£ + 1) = 2fi? . Using the 
spherical coordinate expression for L^: 

Lx = ih (sin ipdo + cot 9 cos fd^) (6.22) 

and its explicit action on the three states under consideration, one can check that Lx acts 

as the matrix 

^ / 1 \ 

L, = -= 1 1 (6.23) 

Let us find the eigenvalues and eigenvectors of this matrix. First of all, the determinant 
of Lx — Al is (— A^ + /i^A), which has solutions A = —h, A = and X = h. This means 
the eigenvalues of the matrix Lx are —h, and h, just as for L^. On the contrary, the 
eigenvectors are different. The eigenvector corresponding to A = —h can be found by 
demanding 

1 \ \ b ] =0 (6.24) 

V2 





VI = -V2 (6.25) 



Which is solved by 



or any multiple of that vector. Similarly, one can find the eigenvectors corresponding to 
the other two eigenvalues. They are 






V2= { and ^3 = ^/2 (6.26) 
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We are now very close to our original goal. Remember that we set out to find a basis of 
states that are eigenstates of the operator L^ instead of L^. By our above calculation, we 
have actually done so, in a very economic fashion. Indeed, the states corresponding to the 
vectors vi, V2 and ^3 are precisely eigenstates of L^, with eigenvalues —h,0 and h. For 
convenience, we will denote these states by |V'i)) IV'2) and iV's)- For example: vi corre- 
sponds to the state \ipi) = 1|2, 1, —1) — \/2|2, 1,0) + |2, 1, 1), and one can check (although 
that is a bit tedious) that the corresponding wave function indeed is an eigenstate of Lx 
with eigenvalue —h. If we would switch to a notation where the vectors 



(6.27) 



would represent the states |V'i); 1^2) and {ips), than L^ would have precisely the form L^ 
had before: 

/I \ 

h\ . (6.28) 

V -1 / 

This confirms that one may just as well work with Lx eigenstates by a simple change of 
basis. We can easily make this basis orthonorma/ by getting the normalization right. We 
illustrate this with |V'i): redefine 




1^1) = iV(l|2, 1, -1) - V2|2, 1, 0) + |2, 1, 1)) (6.29) 

with N a suitable constant. Demanding (V'llV'i) = 1 gives 

1 = ((2,l,-l|-^/2(2,l,0| + (2,l,l|)iV 

•iV(l|2,l,-l)-\/2|2,l,0) + |2,l,l)) 
= |iV|2(l + 2 + l) 

- 1^1 = 71 

So we could take for example A^ = ^ or Ne^^ with any value of ^ - as we have seen before 
that they are all considered as a the same physical state anyway. The very same thing 
can be done to normalize the other two vectors. 

Note: in the above example we have only changed basis from L^ to Lx eigenstates for 
the n = 2 level; the general principle is hopefully clear. 

6.3.2 Gap 2: Another CoSCO 

Here, we do a baby example involving the use of a CoSCO. Consider the operator D^. Its 
eigenfunctions with eigenvalue —1 are given by e*^ and e~*^. Since these are two linearly 
different functions, they form an eigenspace of dimension two. More general, for each 
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eigenvalue — a^ there are two eigenfunctions e*"^' and e~*"^. This means that the label a 
is not enough to specify which function we are talking about. Can we find a (Hermitian) 
operator that commutes with D^l Yes, we can define the reflection operator R as 

R : fix) ^ Rf{x) = f{-x). (6.30) 

So R maps a function to the function obtained by flipping around the vertical axis: 



'R/x) =j(-x) 




One can check that this commutes with our other operator: 



[R,D^ 



0. 



(6.31) 



Here comes the nice thing: the eigenvalues of R are 1 and —1, and they split the degenerate 
eigenspaces of D^ cleanly into two. Explicitly: 



T3lJ,ax „—iax\ „—iax Aax 

rCye — e j^e — e = 



t piax —iax 



') 



(6.32) 



So the functions e~*'^^+e*"^ and g"*"^^— e*"^ provide a very nice basis: they are simultaneous 
eigenfunctions of D^ and R, uniquely labeled by some a and ±1. That's all. Of course, the 
example has some shortcomings. We did not specify the total Hilbert space that we wanted 
to find a basis for. Also, the basis elements are is a bit special - they are not normalizable. 
This is somewhat unusual, but similar situations will be dealt with in the next chapter. 
The only point however was to show the CoSCO mechanism of using multiple labels to 
describe states uniquely. The obtained basis then diagonalizes all the operators of the 
CSCO. By 'diagonalizing', we just mean 'to form/pick a basis of eigenfunctions' - just like 
in linear algebra. 

6.3.3 Gap 3: The angular momentum commutation relations 

In the story part, we mentioned that the different components of the angular momentum 
do not commute, so that it is not possible to find simultaneous eigenstates. Here, we 
explicitly find these commutators. This also puts many of the formulae of first section 
into practice. Let us start with [Lx,Ly]: 



[L.,, Ly] = [YP, - ZPy, ZP^ - XP,] 



(6.33) 
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using linearity, one gets four commutators. One of them for example is [YPz, ZP^]. Now 
using the 'drag out sideways' (A%) rule from above, we can rewrite that commutator as a 
sum of four terms: 

[YP,,ZP,] = Z[YP,,P,] + [YP,,Z]P, 

= ZY[P,, P,.] + Z[Y, P,]P, + Y[P,, Z]P^ + [Y, Z]P,P, (6.34) 

Phooh, this smells nasty. Luckily, the last expression is not as horrible as is looks. The 
only term involving a nonzero commutator is the Y[Pz,Z]Px term. So the entire above 
expression just equals —ihYPx- From this, we can conclude the following trick. Just look 
if there are a position and momentum operator of the same coordinate occurring in a 
commutator. Such terms are the only possible reason for a commutator to be nonzero. 
Another term of (j6.33p is [ZPy, XP^]. Here, the only problem arises from the z-coordinate. 
Pulling out the two other operators, we get 

[ZPy, XP,] = PyX[Z, P,] = ihXPy (6.35) 

The two remaining terms of ()6.33p have no possible problem: only mixed commutators, 
or position-position or momentum-momentum. So they are zero. In conclusion: 

[Lx, Ly] = ihXPy - ihYPx = ihL-^ (6.36) 

Analogously, one can find the two other commutators. Writing all results together, we 
have: 

[Lx^Lyi = inLz, [Ly,Lz\ = inLx, [Lz,Lx\ = inLy 

The above relations are called the angular momentum commutation relations. They 
show that the different components of the angular momentum aren't compatible observ- 
ables. Computing them used everything we learned about commutators, so if you have 
reached/understood this page, you can rightfully call yourself a true commander of com- 
mutator craft. Good job! 
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Exercises 

1. Explain in detail to someone (or yourself) what a CoSCO is. Can you reconstruct 
the reasoning? 

2. Use the above angular momentum commutation relations and the 'pull out sideways' 
rule to show that [L^,L^] = 0. 

3. Compute [f{X),P] where / is a function. 

4. Show the Baker-Campbell-HausdorfF formula, which states that e Be~ = 
B + [A,B] + ^[A, [A,B]] + ^[A, [A, [A,B]]] + .... (Hint 1: the exponential of an 
operator is defined by the series expansion. Hint 2: The expression on the left can 
be seen as the function F{s) = e'^^Ye~^^ , evaluated at s = 1. The value there can 
also be obtained by series expanding F{s) around 0. What relation is there between 
two consecutive coefficients of the series expansion?) 

5. Find the states with n = 2 which are eigenstates of Ly. 

6. Make up two operators and computer their commutator. 

7. It was stated that two observables commute if and only if they can be simultaneously 
diagonalized (=admit a basis of states which are all eigenstates of both operators). 
What is the linear algebra analog of this statement? Can you prove this? 



Chapter 7 

Position and momentum basis 



In this chapter... 

In the previous chapters, we have learned that a particle is not a point but an object 
smeared out over some region in space. This means the position is not sharply defined. In 
this chapter we will think a bit more careful about the momentum of a particle. We will 
see that (just like position) the momentum of a particle is not sharply defined. In fact, we 
will see that a particle can be thought of as a packet spread out over different momenta, 
in the very same way that it is a packet spread out over different positions. An important 
mathematical tool will be the notion of Fourier transform. This will allow us to switch 
back and forth between the two descriptions, and give some more feel about the nature of 
the wave function. 
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7.1 



® 



Fourier's trick 



As mentioned in the introduction, we will need a mathematical tool called Fourier trans- 
form. We introduce this tool here. If you have not learned about this before, be sure to 
read the following carefully. It is an important technique for physics and mathematics, 
and understanding it well will surely benefit you later on. If you have seen Fourier trans- 
forms before, just see the following section as a short recap. We work in two steps: first 
we study the baby brother of Fourier transform: Fourier series. After that, we are in a 
better position to understand the (closely related) topic of Fourier transforms. 

7.1.1 Fourier series 

Imagine you have a function that is periodic, that is: it consists of a single piece, which is 
repeated over and over. For instance, it could look like this: 




The length (in the x-direction) of the repeated piece is the period of the function]^ 
For the function above, the period T has been indicated on two places. Suppose that the 
period of a function is equal to 2tt. In that case, you can capture all the information of 
the function effectively by giving its value on the interval [— vr, vr]. Indeed, the value of the 
function anywhere outside the interval (say at some x + Ink, with x S [— tt, +7r] and k an 
integer) is just given by 

f{x + 27:k) = f{x). (7.1) 

You know that there is a very important pair of functions functions that are of this 
kind (periodic with period 27r): the trigoniometric functions sina; and cosx. In fact, all 
functions sin(nx) and cos(nx) (with n a positive integer) are periodic with period 2tt]^ 
The central question that lies at the heart of Fourier series, is the following: 

Can we write the periodic function / as a linear combination of the functions 
sin(nx) and cos(na;)? 



^For traveling waves, the period refers to the duration of the time cycle, and the wavelength refers to 
the periodic behavior in the spatial direction. Here, as we are considering a function of one variable only, 
you may call the length of a cycle either wavelength or period - the second one being slightly more popular. 

^Actually, the functions sin(n3;) and cos(nx) have period 27r/n, but this implies they are also periodic 
over a distance of 27r - as this is just a multiple of their period. 
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This question essentially asks how (and if) we can decompose an arbitrary periodic function 
into harmonic functions (sines and cosines). Because harmonic functions have some really 
nice properties, such a decomposition is obviously an interesting trick. It turns out that 
for almost every well behaved periodic function, this is indeed possible. An illustration 
of how this works is shown in Figure 17.11 Let us try to see how this works in general. 
With some good faith, suppose such a decomposition is indeed possible. So assume that 
for every well-behaved 27r-periodic function / there exist numbers aj and bi (with i > 1) 
and an ag such that 

oo 



f(.x) = ^ + X] [^n cos(n2;) + bn sin(nx)] (7.2) 



2 



n=l 



We have also included a constant function (first term) , the last part is a sum over infinitely 
many harmonic functions. For this infinite sum to be well defined, we need the a„ and bn 
to go to zero fast enough to ensure convergence. This is a mathematical problem which 
we will not touch upon here. A more important question one immediately comes up with 
is: given that this decomposition is possible, how do I find the values of the a^ and 6„? 
Here is a crucial observation: you can show that for all n and all m 



sin(nx) cos{mx)dx = (7-3) 

sin(nx) sin(7nx)(ix = Trdnm (7-4) 

cos(nx) cos{7nx)dx = irSnm (7-5) 



where 5nm is again the Kronecker delta: 

I ]^ if 77, ^ 777, 

Snm = { n -f -L ^ '^^ 

[ U it n 7^ 771 

In words: if one integrates a product of two different harmonic functions (n ^ m or sine 
times cosine) the result will always be zero. Aha! Here is the miracle: if you multiply 
(|7.2p by coskx and integrate x over [— vr, vr], you get: 

1 r 

ak = — f {x) cos{kx) dx V/c > (7.7) 

This expression also incorporates the expression for the constant term cq. In a similar 
fashion you can show that 

1 r 

bk = - f{x)sm{kx)dx \fk > I (7.8) 

71" J-n 

In conclusion: by performing simple integrals of the function /, we can find the constants 
Un and bn- This gives a straightforward way to decompose any (well-behaved) periodic 
function into harmonics. Nice trick, Mr. Fourier! 
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Figure 7.1: A Fourier series decomposes an arbitrary periodic function into harmonics. 
The left side shows some of the first harmonic functions involved in Fourier series. The 
right shows a function with the shape of a sawtooth. The line /i = sin x includes only one 
term of the Fourier series. The function /s is a sum of more harmonic functions. Including 
even more terms, you can approximation the sawtooth with arbitrary precision. 



Short story 

You might wonder about the possible uses of such a decomposition. If you ever have a 
course on differential equations, you are quite likely to learn the above trick, and see how 
it helps solving problems like heat evolution, the evolution of waves, etcetera. Another 
nice application are voice recognition and synthesizers. So either your favorite action 
movie scene (involving some fancy voice technology to open a safe) or your favorite music 
group might very well rely on Fourier transforms. As follows: every tone (of a voice or an 
instrument) typically has a frequency associated to it. However, a sound is seldom 'pure', 
but consists of several superposed frequencies. These extra frequencies are sometimes 
called overtones. Their relative amplitude make the sound to what it is: they give it the 
typical total sound called timbre. It is precisely this what allows us to distinguish between 
different instruments and different voices: not just their pitch, but the combination of 
different frequencies. Here, Fourier analysis comes into play. Imagine someone gives you a 
single sound signal (an amplitude as a function of time), and asks you to analyze it. This 
is not an easy task. However, if you use Fourier analysis (by f.e. some software that does 
the integrals/decomposition for you) you get a very nice overview of the frequencies that 
make up the signal. This allows you to very economically summarize the signal - as if you 
got it's fingerprint. This way, a synthesizer can store large numbers of different sounds, 
not relying on a sound database, but merely keeping some characteristic numbers for each 
one of them. For speaker recognition, the idea is similar: the composition of someone's 
voice can be analyzed and allows pretty well to uniquely characterize it, allowing for a 
rather secure identification method, just like a fingerprint does. Just to give a slight hint 
on the versatility of this technique. 
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7.1.2 Fourier transform 

You may wonder: can make the above trick useful in the context of wave functions too? 
The answer is yes, but there are two important differences with the above. First, a wave 
function is not typically a periodic function. Second, a wave function is a complex function 
while the above only dealt with real functions. This means we will have to adapt the above 
trick slightly, to a technique called Fourier transform. We start out with the remark 
that the complex analogue of sin(fcx) and cos(A;x) is the function 

e*''^ with A; real (7.9) 

So it is natural to go and try to decompose a complex function into several such e**^^. 
Since in the context of a wave function no specific periodicity is required, k can take on 
any real value. This suggests we need an integral (not just a sum) over different fe's. More 
precise, we want to write 

1 r+oo 

f{x) = ^= dkg{k)e'^- (7.10) 



Here, g{k) is a complex function, the Fourier transform of /, and has the same role as the 
coefficients a„ and 6„ before. The prefactor is just there for convenienceo The function 
g{k) describes to what extent each of the waves e is 'present' in the shape of /. So if 
f{x) has a piece that looks like a (complex) oscillation e °^, then g{k) will typically be 
large around this /cq- Just like with a Fourier series, we can obtain g by performing an 
integral of /. It turns out that g can be obtained as follows: 



g{k) = ^ / dx f{x)e-''- (7.11) 



Notice how the last two expressions are nicely symmetric (up to the minus sign). For this 
reason, the second one is called inverse Fourier transform. Let us try to show how 
the above definition of g indeed is the correct one. In words, we want to show that the 
above definition of g substituted into (j7.10|) indeed gives / as a result. If we plug in the 
definition, we get 



1 



+00 



dk 

27r J-oo 



1 



dx' f{x')e 



-ikx 



e'"'' (7.12) 



and we want to show that this indeed equals /(x). (We have changed the integration 
variable to x' , since x is already taken - its the argument of function /.) 



^Also, note that we have written the integration measure dk before the integrand and not after it. In 
case you have not met this before: it is a different notation but has an identical meaning. If you are 
confused by it, just scratch away the dk and write it at the end of the expression. 
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Proof that (P7TT2D = /(x). 

If we rearrange, and rewrite the range of integration as a limit, expression (|7.12p becomes: 

I dx' f{x') lim / 
2vr ./_oo K^°° J-K 



— / dx' f{x') lim / dfce^'^^^-^') (7.13) 

2vr y_oo A'^oo y_^ 



We have interchanged the integrals, which -in the context of physics- is no problem. (Math- 
ematicians will tell you that this may not always be correct but we're not going to be that 
careful.) Now the integral on the right can be done explicitly (renaming y = x — x'): 



K r 1 n k=K 



dke^'^y 

-K 



_giky 

ik 



sinfi^w) 
= 2 — ^—^ (7.14) 

J k=-K y 



The function on the right side (not including the factor 2) is called the sine function. It's 
shape is shown in Figure 17.21 There, you can see that the central peak is getting higher 
and more narrow with increasing K. The limit for large K occurring in (|7.13|) goes to (vr 
times) a very special function, the Dirac delta function (denoted 5): 

lim ^^^^ = 7r6{y) (7.15) 

K— s>oo y 

We will say more about this function in the next section, but we already give one of its 
properties, namely: 

poo 

f{x)6{x) dx = /(O) (7.16) 



for every function /. This is a peculiar property, but as promised, the next section will 
explain this more carefully, so don't worry - just assume it is true for now. With this 
information, we can rewrite ()7.13|) further. In terms of the new variable y = x — x' , the 
right hand side becomes 



r°° sin 07/ 1 f^ 

/ dy fix - y) lim 2 ^ = — / dy f{x- y)2vr<5(y) (7.17) 

J-oo f^^°o y 27r J_^ 



Using the special property of the delta-function, we can rewrite this last expression as 

^ y" dy fix - y) 2TT5iy) = fix - y)|,=o = fix) (7.18) 

Ah! This is precisely what we wanted. The above line shows that the definition of gih) 
is indeed the right one, and that it correctly gives the decomposition of fix) into a sum 
(integral) of oscillating functions e*'^^. This decomposition will turn out to have some nice 
applications in quantum mechanics, as we will see in the rest of this chapter. Before we 
go there, some more words on this mysterious delta function. 
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Figure 7.2: The sine funetion y = £1£LJ£1^ plotted for several values of Jl. 



7.1.3 Delta function 

Consider the following series of functions Xn (with n a positive integer): 



Xn{x) 



everywhere else 



(7.19) 



These functions have the following property: for large n they are only nonzero within a 
very narrow interval. Despite this narrowing, the surface under each of these functions 
is always 1, due to the fact that the height of the bump grows bigger and bigger (Figure 
^. Explicitly: 






%2 



X: 



-Vi 



X 



Figure 7.3: The functions Xn- 
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Xnix)dx = l Vn (7.20) 

-oo 

Now here is an alternative (but equivalent) definition of the Dirac delta function we met 
above: 

5{x) = lim xn{x). (7.21) 

Now if you like performing limits, you will immediately see that 

I everywhere else 

This is definitely not an ordinary function. In fact, the object 5 is not a function, but a 
distribution. A distribution is an object that makes not so much sense on itself, only 
when it standing inside an integral. Indeed, although the above prescription looks a bit 
strange, the integral of 5 is perfectly fine: 

/oo /»oo /»oo 

5{x)dx = / lim Xn{x) dx = lim / Xn{x) dx = lim 1 = 1 (7.23) 

-oo J —oo J —CO 

Also, using the fact that for a continuous function / 

fX+e 



lim r ' f{y)dy = 2ef{x) (7.24) 



we get 



n— >oo , 

' -oo 



oo 



f{x)6{x) dx = lim / f{x)xn{x) dx (7.25) 



j-l/2n 

= lim n I f{x) dx (7.26) 

'^^°° J-l/2n 

= lim n-f{0) (7.27) 

n— ^-oo n 

= /(O). (7.28) 

This result is precisely the property we used in the section about Fourier series! In words: 
the central peak of S{x) picks out the region around zero as the only contribution to the 
integral, giving /(O) as a result. From this property, one can derive that 



f{x)6{x-a)dx = f{a). (7.29) 

-oo 

To see this, just rewrite this last expression in terms of a new integration variable y = x — a. 
Hopefully, you are not too much confused by the peculiar character of this object. Just 
keep in mind that the delta function is not an ordinary function, but a more general 
object: a distribution. As physicists are less careful about some things (like interchanging 
sums and integrals an limits in all the above) they tend to treat the object 5{x) just as a 
function, and even named it that way. You will get used to these properties later on. 
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Small remark 

A last remark: you may wonder whether the definition used here is really equivalent to 
the one from section on Fourier transforms. We will not really show this, but you may be 
convinced by looking back at Figure [72J There too, we were dealing with an infinite series 
of functions (labeled by K instead of n). There too, a central peak was getting higher and 
narrower, while maintaining the area under the graph constant. Hopefully you see that 
such a series of functions will converge to the same 'object', the Dirac delta function. Just 
notice that all the above properties derived above only depended on the limiting behavior 
of the Xn, and not so much on their specific shape. So far the tools, let us get back to the 
world of quantum mechanics. 



7.2 V Changing habits 

In the previous chapters, you learned that the states of a particle form a complex vector 
space, the Hilbert space. So far, we have studied three systems: a particle in a box, a 
particle in a harmonic potential, and the hydrogen atom. In each case we have found energy 
eigenstates as a natural basis (modulo soms complications that arose for the Hydrogen 
atom). Now let's try to change habits a bit, and see if we can also express states in terms 
of another basis. Here we will describe two other natural bases: the momentum basis 
and the position basis. Just to make things easier, we will work in one dimension - this 
avoids working with vectors - but all can be readily generalized to three dimensions. 

7.2.1 Momentum basis 

We have met the momentum operator already several times: 

P = -ihD^ (7.30) 

You can easily check that this operator has eigenfunctions 

e'P''/^ (7.31) 

with eigenvalue p. Now imagine someone gives you the wave function tlj{x) of a particle, 
and asks you whether you can write that wave function as a sum of momentum eigenstates 
g«pa;/n_ You think about it for a moment, and then you realize: of course I can! This is 
precisely what a Fourier transform does. So yes, you can write as the decomposition 

^{x) = -j=J dp i^{p)e"P-'^ (7.32) 

where ip{p) is given by 



i;{p) = —= I dx i^{x)e-'P''/^ (7.33) 
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Compared to the original expression for the Fourier transform, some extra ^'s have popped 
up in the prefactors. This is just a convention, and due to the fact that we decompose into 
^tpx/h jj-^gtead of the ordinary e . Furthermore, we have denoted the Fourier transform 
ip{p) by the same letter -0 - in principle this is an ambiguous notation, but we will always 
write the argument explicitly to avoid confusion. So ^{p) and 'ip{x) are different functions. 
The Fourier transform ^{p) is called the momentum space representation of the wave 
function. For each p, it tells you to what extent the wave function looks like e*ft^. To 
contrast, the wave function ip{x) which we worked with up to now, is called the position 
representation. The momentum representation (like with the position representation) 
is a function spread out over a certain region. So just like the fact that a wave function 
doesn't have a sharp position, it has a smeared out (complex) function describing its 
momentum. This means we have a very symmetric way to describe a particle: either by 
its wave function in position space ip{x), or by its wave function in momentum space ip{p). 

Bra-ket notation 

We can also rewrite the above in bra-ket notation. First, we define the momentum state 

\p) by taking the corresponding wave function as follows: 

ipx/h 

With this, we can write the state |'i/') corresponding to the wave function iIj{x) as follows: 

dpHp)\p) (7.35) 



If you see the integral as an infinite sum, the right hand side is a linear combination of all 
the different \p). The tp{p) can thus be seen as the coefficients of all the \p) in this linear 
combination and is given explicitly by (|7.33p . With this, we have reached the goal of this 
section: decomposing an arbitrary state {ip) with respect to the basis of all momentum 
eigenstates \p). 

Measurements 

Beside giving the particle's momentum 'components', the function 'ip{p) has another mean- 
ing. Given a state \ip) and a device that measures the momentum, the chance to receive 
an outcome between momenta po and pi is given by 

rpi 
P{po < (outcome of measurement) < pi) = / IV'Cp)! ^^P (7.36) 

So the object |V'(p)P acts as a probability density, just like |V'(a;)p. Yes, yes: it is all 
very symmetric between x and p. Here is an example of a wave function ^(^{x) (real part 
shown) and its momentum representation ip{p): 
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\|/(x) 





X 



7.2.2 Position basis 

We now know there is something like a momentum basis. Let us now try to hunt for 
its counterpart, the position basis. First, think about this question more careful. The 
eigenstates we found for the momentum operator had a very clear physical meaning: they 
were infinite waves, with a sharp and well-defined momentum. Similarly, we are now 
looking for functions that can describe a wave function which is sharply localized, so that 
it has an exact position. We did recently meet such 'infinitely sharp' function: the delta 
function. Indeed, the set of functions 



fxoix) = S{x -Xo) 



(7.37) 



are all infinitly peaked. The peak of each f^^ is at xq, as on that place the function value 
is 6{xo — Xo) = S{0) = (X). So we suggest that the function fxQ describes a wave function 
of a state with exact (sharp) position xq. We denote such a sharp position-state by l^o), 
so 



wave function /a;Q -H- state l^o) -H- state sharply localized at position xq 



(7.38) 



Let us do a check on this. For consistency, we should show that the position-states \xq) are 
indeed eigenstates of the position operator X, just in the same way the momentum-states 
\p) are eigenstates of the momentum operator P. So we want to show: 



X\xo) 



XolXo) 



(7.39) 



for all Xq. Using that X acts as multiplication with the identity function (so X : f{x) 
x ■ f{x)) the wave functions of both sides are 



X 5{x — Xo) = Xq 5{x — Xq). 



(7.40) 



As we have said before, 5 is a distribution, and such an object makes better sense when 
standing inside an integral. Let us see what both sides of ()7.40p give inside an integral 
with a random function h{x): 



oo 
oo 



h{x) {x 5{x — Xq)) dx 
h{x) {xq 5{x — Xq)) dx 



Xq 



{xh{x)) 5{x — Xq) dx = Xq h{xQ) 
h{x) 5{x — Xq) dx = Xq /l(xo) 
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The result is the same! As the above holds for any function h(x), we have to conclude 
that the objects xd{x — xo) and xq5{x — xq) are indeed equal. This proves (j7.40p and hence 

dZSSD. 

Bra-ket notation 

Can we now express any state as a hnear combination of position states \xq)7 By analogy 
with momentum eigenstates, you might guess that we can write an arbitrary state |'0) as 
follows: 

dxotlj{xo)\xo). (7.41) 



where "(/^(^^o) is the wave function, evaluated at xq. This expression makes sense since the 
weight given to each position state |xo) in the above 'sum' is just 'tp{xo): precisely the 
value of the wave function at position xq. Better even, we can actually prove the above 
equation by showing that the state on the right side really has wave function ^(x). To 
see this, note that each \xo) has wave function 6{x — xq), so the total wave function of the 
right hand side is 

/•oo 

dxo^p{xo)6{x -xo), (7.42) 



which just equals ip{x) by the standard property of the delta-function. Expression (|7.4ip 
completes our second goal: writing out an arbitrary state {ip) as a linear combination 
(integral) of position-basis elements. This means we now understand how to write an 
arbitrary state in several bases: 

• a basis of energy eigenstates (as in the first chapters), 

• a basis of momentum eigenstates \p) (each with weight ip{p)) 

• or a basis of position eigenstates |x) (each with weight ip{x)). 

The last two give the momentum- and space representation we wanted to discover. Before 
we do some concrete examples, one last remark. 

Caricature states 

We just got to know two new kinds of states: momentum states \p) and position states 
|x). (From now on, we drop the subscript on the position state, so by |x) we mean the 
position state located at position x.) Both are a bit peculiar: one is infinite in extend 
(an infinite wave) the other is infinitely sharp (a delta function). We already mentioned 
that real particles are never in such a state, but always a superposition (integral) of these, 
which results in a nice, finite and smooth wave function. So what are the states \p) and |x) 
then? The best way to describe them is as caricature states. They describe a particle, 
if you could squeeze it into a perfect state with exact momentum, or into a perfect state of 
exact position. A very good comparison are point masses in Newtonian mechanics. When 
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dealing with problems of bodies moving and interacting, you were often told to 'treat the 
bodies involved as point masses'. This is not a realistic description, but allows for simple 
calculations. A point mass does not exist, but the mathematics of such an object is much 
easier than that of a (physical, extended) object. In very much the same way, the states 
\p) and \x) do not 'exist in nature', but they have some nice properties, like providing a 
clean basis in which you can express a physical state |^). That is why we study them, 
nothing more, nothing less. 

7.3 13 Using the position and momentum representation 

7.3.1 Example 

We will now use all the above in the simplest example we have met so far: the particle in 
a box. Recall that we found energy eigenstates labelled by n = 1, 2, ...: 

V'n(a;) = y-sm-^ (7.43) 

These satisfy Hipnix) = Enipn, and their time evolution dictated by the SE is given by 

V'n(x,t) = e-*^"*/Vn(x) (7.44) 

The most general element in the Hilbert space is given by 

i'{x) = J2ani'n{x) (7.45) 

n 

and its time evolution is just given by il){x,t) = ^„ anipn{x,t). 

Position basis 

The last equation describes a general element {ip) in the Hilbert space, decomposed in 
energy eigenstates. Here, we will try to express this very state \ip) in the position- or 
momentum basis. Let us start with the easy one: the position-basis. First, note that 
every energy eigenstate {ipn) can be written as 

iV'n) = / dx 'ilJnix)\x) = dx i W - sin — — \x) (7.46) 
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The range of integration has changed in the last step: this is because the wave functions 
^pn are actually zero outside [0,L]. Hence, for the state \ijj) itself, we have: 



(7.47) 
(7.48) 
(7.49) 




2 niTX \ 
^sm^ |x) 



nirx 
a„sin —;— 1 \x} 



L 



In the same way the a„ are the coefficients of \^) with respect to the energy-eigenstate 

basis, the function ( a /|; ^„ a„ sin ^^ j describes the 'coefficients' of the same state when 

expressed in the position basis. In simple terms: this object is just the (position) wave 
function of {ip). Can we also do the same for momentum? That is, can we also find a 
function Tp{p) such that 

/•CO 

dp^{p)\p) (7.50) 



Again we first try to do this for each \ipn) individually, so we will first look for functions 
ipn{p) satisfying 

j-OO 

Hn)= dpMp)\p) (7.51) 



From the general formula (|7.33p for the Fourier transform, we get 

1 



V'n(p) 



V27rh 

1 
\/2vr/i Jo 



dx e-^P^'/Vn(a 



(7.52) 
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x=L 
x=0 



(7.53) 



In the last step we have defined ci = (^ + x") ^"^^ '^2 = (-|f — x")' With these expres- 
sions, you can write out the end result and simplify it a little bit, but we don't do this 
effort here. Just note that the end result depends on p - via the objects ci and C2. We 
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have just done our first Fourier transform! Congratulations. With this, the state |'0) can 
then be written as follows: 



n 

(POO N 

/ dpipnip)\p) 
J —oa / 

y""dpf^a„Vn(p)j |P> (7.54) 



where the object between brackets -with ipnip) given in (|7.53|) - is just one function of p. 
This is the finish line! We have succeeded in writing |'0) not only in energy representation 
(j7.45p but also in the position basis (J7.49P and momentum basis (|7.54|) . If you managed 
to follow this last derivation, you have probably understood most of this chapter so far. 

7.3.2 Projection, inner product and completeness 

As a last application of what we have learned in this chapter, we give the so-called com- 
pleteness relations. These are very useful equations, and provide a nice mnemonic for 
the formula of Fourier transform. First, recall that in a vector space we could take the 
projection of a vector on a basis element: 

{ei,v) = Vi (7.55) 

If the basis vectors Cj are orthonormal, we can get back the original vector v by summing 
up all its projections along the basis vectors: 

v = '^eiVi = ^ei{ei,v) (7.56) 

i i 

Or if we write the right hand side with a Hermitian conjugate|_| 

In the last step, we used associativity of matrix multiplication. This is true for all v, which 
implies 

5^eier = l (7.58) 



*In the context of ordinary linear algebra, the Hermitian adjoint is denoted by an asterisk * instead of 
af. 
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where 1 stands for the identity matrix. For example, in a two dimensional vector space 
with orthonormal basis ei = I ^ I and 62 = I ) we have 



E^>"^^* = (J)(i «) + (?)( ° ') 



II ]■ (7-59) 



(x|V) = 


= ^\^{x) 


im -- 


= V'(p) 


{x\p) -- 


V27rn 


(3:1^2) = 


= b{x\ — X2) 


{Pl\P2) -- 


= (^(^1-^2) 



Equation (j7.58p is the completeness relation (for vector spaces). The name is derived from 
the fact that such a relation only holds for a complete (and orthonormal) basis. Again, 
it is possible to extend this to the context of quantum mechanics. First, we give some 
results that you can show yourself (see exercises): 

(7.60) 
(7.61) 

(7.62) 

(7.63) 
(7.64) 

Especially the last two equations are very suggestive: the states \x) and \'p) are 'orthonor- 
mal': only then with a Dirac delta function instead of a Kronecker delta. This suggests 
maybe here too, we have something like a completeness relation. Indeed, it turns out that 

(7.65) 

(7.66) 

Where 1 is the identity operator (sends every state to itself). These are the completeness 
relations for quantum mechanics. You might be a bit puzzled if the objects on the left side 
are really operators. The object J'_ dx\x){x\ for example is to be read as the command: 
starting with a state, take the inner product with (x|. Then, multiply this (complex) 
number by the corresponding state \x) and then sum over all x. This operation indeed 
sends a state to a state. Hopefully, you see the full parallel with ()7.58p . Here are nice 
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= 1. 
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consequences of the above equations. First: 

\^) = im (7.67) 

dx\x){x\] lip) (7.68) 



— oo 
oo 



dx\x){x\ip) (7.69) 

dx\x)ij{x) (7.70) 

dxij{x)\x). (7.71) 

Ah! If you compare the first and last part, you see that we precisely get the decomposition 
of 1^) into position states. That's a really short way to get there. Another example: 

ip{x) = {x\iP) (7.72) 

= {x\im (7.73) 



oo 



{x\{J dp\p){p\\\^) (7.74) 

dp {x\p) (j)\tp) (7.75) 

dp e'P''/^ V(P) (7.76) 



— oo 
1 fx) 



In the before last step, when pulling out the integral to the left, we have used linearity of 
the inner product. Once again comparing first- and last part, we see that we have fully 
rediscovered the Fourier transform, out of the hat! This ultrashort 'proof of the Fourier 
transform formula is a very clear sign of the power of the bra-ket notation. You may even 
start to like it one day... . 

Conclusion 

In this chapter, you have learned to view a wave packet in a different way: as a super- 
position of different infinite waves (momentum-eigenstates), each with some weight. This 
should caution you a bit when speaking about a particle's momentum - just like with 
position the particle's state comprises an extended region of momenta. Of course, you can 
still speak of the expectation value of both, these are well-defined. 

When you think back of the de Broglie relations, you may be a bit confused: there, the 
particle's momentum, energy, wavelength occur. Actually, it is best to see the variables 
over there as expectation values, for a state which is sufficiently centered around these 
values. 
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Exercises 

1. Show STMh - dZSl). 

2. Argue that the probabihty density of the momentum and position representation 
are actually mere applications of the Postulate on the measurement, with the only 
difference that sums are replaced by integrals, and probabilities by probability den- 
sities. 

3. Check that (j7.3p is true (should not be much work). Can you find an easy way 
to show ()7.4p for the cases n ^ ml (Hint: the states f/^n of the particle in a box 
have different eigenvalues under H. What does this imply? Can you relate this 
to what we want to show here?) To show the case n = m, you may want to use 
sin^ mx = ~^°s mx ^ -^^^ ^^iq result to check that we properly normalized the states 
of the particle in the box by putting the prefactor y^2/L. 

4. Consider the function /(x), which has period 2tt, has f{x) = —1 if — vr < x < and 
/(x) = 1 if < X < vr. This function is called the square wave. What does this 
function look like: a sine or a cosine function? So which component of the series do 
you expect to be largest? Now go ahead and compute its Fourier series. (You best 
split the integrals in two parts. Which Fourier components are zero?) 

5. Consider the wave function Vn(^) = ^ which equals ^/n on ^ < x < 2:^ and 
zero elsewhere. What is the Fourier transform of ■(/'n(x)? Which function do you 
recognize? 



Chapter 8 

The uncertainty principle 



In this chapter... 

In the chapter on commutators we showed that compatible observables (observables ad- 
mitting a basis elements, the elements of which are eigenstates to both operators) always 
commute. In this chapter, we spend some time on a logical consequence of this: if op- 
erators don't commute, it is impossible to find such a basis of common eigenstates. A 
quantitative consequence is the so-called uncertainty principle. This is an important prin- 
ciple, with a very high degree of quantum flavor to it: it will help you to get a better feel 
for the nature of wave functions. 

In the tool part, we show an important example of a Fourier transform which al- 
ready suggests the uncertainty principle. In the story, we explain in more detail what 
the principle is, where it comes from. We then give a proof and conclude with several 
consequences /applications . 
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8.1 iio Fourier transform of Gaussians 

As promised in the intro, the tool part consists of an extra example on Fourier transforms. 
Two other small things we will review are the notion of variance, and the Cauchy-Schwarz 
inequality. Ready? Go! 

8.1.1 Variance 

In statistics a function p{x) is called a probability density function if for each x, p{x) 
is a measure for the chance to have x as an outcome. The usage of this term in quantum 
mechanics is just a special case of this. For the total probability to be one, we need: 

/oo 
pix)dx = l (8.1) 

-oo 

One then defines the expectation value of x, as follows: 

/•oo 

{x) = / xp{x)dx. (8.2) 



For example, if p{x) is the chance to win an amount x at the lottery, then (x) will give 
the amount you win on average per game. (Which is always lower than the cost of you 
ticket, alas.) Equally interesting, one may ask how much your winnings will typically be 
away from the average winnings. A quantity expressing this, is given by 

/oo 
{x-{x)fp{x)dx. (8.3) 

-oo 

This quantity indeed says something about the typical deviation: it gives the expectation 
of the squared deviation from average. This quantity is the variance Var(x), and its 
square root is called the uncertainty Ax: 

{{x - {x)f) = Yav{x) = {Axf (8.4) 

Of course you see that the above is what we have been using in the context of quantum 
mechanics. Just write the probability density |V'(x)p instead of the probability density 
function p. The quantity J^ \ip\'^xdx then nicely gives the expectation value of the 
position: 

/■oo /'OO 

{X) = {iP\X\tP) = TP*{x)XiP{x)dx= x\ip{x)\'^ dx. (8.5) 



Similarly the uncertainty of the position is given by 



AX = V((^-W)^> (8.6) 
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with 



{{X - {X)f 



{x - {X)f\^{x)\'^ dx. 



More general, for any observable O, we can define the expectation value (O) 
(see chapter on observables) , and the corresponding uncertainty is 



(8.7) 

\o\^) 



A0=V((0-(0»2) 



Although the name sounds somewhat mystical, there is nothing deep about the notion of 
uncertainty: it just expresses how far possible measurement outcomes of the observable O 
on a given state \tp) are lying apart. For example: a wave packet that is rather localized 
will have a small uncertainty AX, a packet which is more spread out will have a much 
larger position uncertainty. Last remark: although the last two expressions don't seem to 
contain the state \ip), they really do depend on it as the expectation values () are to be 
taken between {tp\ and \tp). To avoid confusion, some people write subscripts: 



(AO)^ = ^((0-(0)^)2)^ 



(8.9) 



8.1.2 Fourier transform of Gaussians 



A very important function in math and physics, is the Gaussian distribution. It is given 

by 

1 -jL 



fix) 



V27ra2 



e 2ct^ 



and has variance Var(x) = a^. Also, using the property 



dx 



it is easy to show that / is a properly normalized probability density function. 



(8.10) 



(8.11) 
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Figure 8.1: The Gaussian distribution, for different values of the variance 
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The fact that / is normahzed, means that its square root 

is a normahzed wave function: J^ \(p{x)\'^ dx = 1. Now we ask the question: what is the 
Fourier transform of this function? In other words: what is the momentum representation 
(j){p) of the state |(/))? We know that 

^{p) = -^=J dx^{x)e-^P^I^ (8.13) 

SO we get 

-I -1 /"OO 2 

m= ',, '/J dxe-f^^^^^'^ (8.14) 

At first sight, performing the integral is not so obvious. However, there is a smart way 
to recast it into a form that ahows to use ()8.1ip . This is done by completing the square 
standing in the exponent, as follows. Go to a new variable x' = x — iap/h. Then 



and 



Exponentiating, we get 



11 1 1 

— X —X P r,P^ ^ ^ 

4o-2 4o-2 h h? 



-' .E .2 4 



e^^ = el^e'^^Ae'" ^ (8.17) 

which is the integrand of ()8.14|) up to the last (x-independent) factor. This means that in 
terms of x' and dx'{= dx), 

<^{V) = r—:,^ 2U/4 / dx'e-^e~''l? (8.18) 

V27r?i(27rcr^)V4 J _^ 

re'" ^ / dx e~^^ (8.19) 

J —oo 



V2^ (27ra2)i/4 



1 1 -^2£: 



^/2^(27^a2)l/4 



2 

e"'''^ • Vi^ (8.20) 



where in the last step we have used (|8.11|) . Rearranging, and defining a' = 2^, we finally 
get 

Comparing to ()8.12p . you see that this distribution is again Gaussian, but now with 
variance a''^ instead of a^. This result is shown in the next figure: 
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This is quite remarkable: if a Gaussian wave packet is sharply peaked, its momentum 
representation will be very spread out, and vice versa. This is already a first glimpse of 
the uncertainty principle which we are about to meet. Before we go there, another small 
recap which might be useful: 

8.1.3 Cauchy-Schwarz inequality 

You probably remember the famous Cauchy-Schwarz inequality from a course on linear 
algebra. It states that for two vectors x and y, 



\x ■ y\ < ||x| 



\y\ 



?.22) 



In words: the magnitude of the inner product of two vectors is always smaller than the 
product of their lengths. The reason for this inequality is very simple. For vectors, the 
inner product is just given hy x ■ y = \\x\\ . \\y\\ . cos 9 with 9 the angle between the two 
vectors. Taking absolute values on both sides, you then get 

\x • y\ = \\x\\ . \\y\\ . I cos \ < \\x\\ . \\y\\ (8.23) 

This inequality can also be expressed in terms of inner products only. Writing {x, y) 



instead oi x ■ y and using ||x| 



\/(^,^), we get 
\{x,if}\^ < {x,x){y,y). 



124) 



It turns out that this inequality is not only true for ordinary vector spaces W^, but for any 
space with an inner product. This means that also in the context of the Hilbert space 



Kxl^>l' < (xlxX^IV') 



(8.25) 



for all states |x) and \ip). We will use this form of the Cauchy-Schwarz inequality later on 
in this chapter. Time for the story part... 
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8.2 te) The uncertainty principle 

8.2.1 Commuting observables, revisited 

In the chapter on the commutator, we learned that if two observables are compatible 
(=there exists a basis of states |aj,/3j), labeled by i and j, that are simultaneous eigenstates 
to both A and B) then these operators commute: [A, B] = 0: 

A\ai,/3j) = ai\ai,l3j) and B\ai, /3j) = I3j\ai, fij) 

=> [A,B]=0 (8.26) 

So in the case of commuting observables, a particle can have a definite value for both of 
them at the same time: this is indeed true for all the states |aj,/3j). (Of course, it does 
not have to be this way: if the particle is in a superposition - say a state proportional to 
\ai,l3j) + \ai',l3j') then it isn't an eigenstate of any of the two observables. But the fact 
that you can find an entire basis of states which are simultaneously eigen is quite special.) 
Now the question we want to address in this chapter is the following: what happens 
if [A, B] 7^ 0? Well, by the negation of the above expression, if two operators do not 
commute, they are incompatible and you can not find a basis of simultaneous eigenstates. 
So simultaneous eigenstates will be rather rare in this case. In practice, there are often 
no such states for non-commuting observables. So in such a situation we expect that an 
eigenstate \a) oi A can (generally) only be a sum of different eigenstates of B: 

|a> = ^lft> with 5|/3i) = ftlft) (8.27) 

i 

So for incompatible observables, if a state has a definite value with respect to one of them, 
it usually does not have a definite value with respect to the other one. 

To make this more concrete, let us think of two observables that do not commute. Ah! 
Position and momentum are such a pair. Recall that 

[X,P]=ih (8.28) 

From the above, we expect that eigenstates of X will not be eigenstates of P. After the 
previous chapter, we know both sets of eigenspace very well. The first are the position 
eigenstates \x) (delta functions) and the second are the momentum eigenstates \p) (infinite 
waves). For sure, these are two very different sets of functions: they do not have any 
overlap. There is not a single state that is both a position eigenstate and a momentum 
eigenstate. 

8.2.2 The Gaussian distribution 

So the above suggests that for every state, at least one of the observables X and P has 
to be 'spread out'. They can not both have a definite value. Let us now look back at the 
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computation we did in the first section. We found that for a Gaussian wave function 

1 ^2 

e 3^, (8.29) 



^' ' (27ra2)i/4 
the momentum representation is also Gaussian 



(2^Cj'2)l/4 

and the variances o"^ and u'^ are related by 



Hp)= r^ Imm ^ ^ (8-30) 



4(t2' 



a'2 = — ,. (8.311 



Remember that the square root of the variance is the uncertainty, so AX = a and AP = a' . 
In terms of these, the above relation becomes 

AXAP = - (8.32) 

8.2.3 The uncertainty principle of Heisenberg 

The relation (j8.32p is a very striking result. It implies that a narrow distribution in 
position space necessarily has a broad distribution in momentum space. It turns out that 
this phenomenon also occurs for many other cases - not only for Gaussian wave packets. 
Every time, the product AXAP is of the order of U. Actually, the Gaussian wave packets 
have a bit of a special role: they realize the lowest value for AXAP. For other wave 
functions, the product of the uncertainties tends to be bigger. This behavior has leads to 
the following inequality: 

AXAP > - (8.33) 

for every state \^l^). This relation is called the uncertainty principle and is due to the 
physicist Werner Heisenberg. It is the central result of this chapter. We will actually prove 
it in the next part. Before we go there, some small remarks: 

• Note that AX and AP are quantities that depend on the specific state of the particle. 
This is because the expectation values () are always to be taken with respect to some 
state. The uncertainty principle states that there is a universal relation between 
those two quantities which is satisfied for every physical state. 

• This relation has an immediate consequence for measurements. Remember that 
measuring a quantity collapses a state into a narrow region around it. For example, 
measuring the position collapses a state in a very narrow region centered around 
the value you read of from the apparatus. Ideally, this is a delta-function (exact 
position eigenstate) but in real (imperfect) measuring devices this is just a 'very 
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very narrow function'. For such a wave function, the uncertainty AX will be small. 
The uncertainty principle then implies that AP is very large. This means that a 
consecutive measurement of momentum can yield a very broad range of results. This 
means it is very hard to predict the outcome of this second measurement. Of course, 
a similar thing occurs when you first measure the momentum and directly after that 
the position. 

• The last lines explained that there are consequences of the uncertainty principle 
for the process of measurement. However, it is important to bear in mind that 
the inequality also has a meaning besides the context of measurement. It is not 
just a limitation of a researcher to measure particular quantities of a system, but 
it is a statement about the nature of the wave function. Actually, it can be seen 
as a mathematical statement, relating the variances of a function and its Fourier 
transform. That's why we can prove it. 





Ap 



AxAj? > ^ 




Figure 8.2: The uncertainty principle: a relation between the variances of the position 
and momentum representation of the wave function. 



8.3 IB Proof and applications 

8.3.1 Proof 

Consider two operators A and B, and a state \ip). Then 

\\A\^l;)f\\B\^)f = (V^|^t^|^)(^|5t5|^) > \{^\A){B\i;)\' (8.34) 

In the first step we used the definition of the norm, and the fact that (^1-0))' = (V'l^ 
and similarly [B\ip))^ = {tplB^ . The last step is nothing but the good old Cauchy-Schwarz 
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inequality, reviewed earlier in this chapter, applied on the states Alip) and B\^). First, 
look at the left hand side of (j8.34p . If we restrict A and B to be observables, they have to 
be hermitean, so A* = A and B' = B. This means the left hand side is equal to 

\\A\m^\\Bmf = {i^\AA\i^){i^\BB\i^) = {A%{B%. (8.35) 

Now the right hand side of ()8.34p . This is the norm squared of the complex number 
(iplABlilj) . Since the norm of a complex number is alway greater than the square of its 
imaginary part, we have 

2 

(8.36) 



ABM)\^> 



l-,{^l;\AB - BA\iP) 
If we now plug in the above two equations in ()8.34p . we get: 

{A%{B%>\\{[A,B])^\' (8.37) 

which is already very close to the general uncertainty relation. For the last step, note that 
the above equation is true for all operators A, B, and all states {ip). In particular, it will 
also be true for the operators A = A — (A)^ and B = B — (B)^. Since (A^) = (AA)'^ , 
{B'^) = (AB)"^ and [A,B] = [A,B], application of the above equation to A and B gives 



{AAy^{ABy^>-\{[A,B])^\' (8.38) 

or, upon taking square roots, and dropping subscripts 

AAAB>^\{[A,B])\ (8.39) 

This is the general form of the uncertainty relation. It holds for any pair of observables A 
and B. For the operators X and P in particular, you can check that it yields the result 

AXAP > - (8.40) 

Also, note that (as we suggested above) there may be some simultaneous eigenstates 
states of noncommuting observables. Indeed, this is possible if this particular state is 
an eigenstate with value of the operator [vl,-B]. In that case, the uncertainty principle 
imposes no conditions on the uncertainties of A and B for that state. 

8.3.2 Applications 

Application: uncertainty principle for other observables 

Another meaningful uncertainty relation can be stated in three dimensions for any pair of 
the angular momentum operators. For example the uncertainties of Jx and Jy are related 
by: 



1,... ,„, 1, 



AJxAJy > -\{[Jx,Jy])\ = -\J,\ (8.41) 
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Another variant is obtained as following. The observables X and P are in many ways 
related. This is why they are sometimes called 'conjugate quantities'. Another pair of 
quantities that display such a 'paired' behavior in many situations are time and energy. 
This inspired some physicists to write 

A^At > - (8.42) 

This relation is a bit more subtle though, as the quantity time (t) is not an observable, it 
is a parameter. Hence, the above equation does not make immediate sense. However, in 
the context of particle physics and decay processes it turns out that a similar relation does 
actually hold, in the following way. If a system decays from one state to another, then 
the typical time after which this decay occurs and the uncertainty of the energy released 
are indeed related by the above inequality. To understand this better: an unstable state 
typically does not have a very sharply defined energy, but rather a small range of energies. 
This means the energy released (energy of the initial unstable state minus the final energy) 
is not always the same, and has a small uncertainty (AE), called the natural linewidth. 
It turns out that the typical time of existence of an unstable state (denoted At) and the 
linewidth {AE) are indeed related by the above formula. This means the more unstable a 
state is (if it decays faster) the bigger the linewidth is. Conclusion: the above uncertainty 
relation has some experimental meaning, but you should be a bit careful with the meaning 
of AE and At there. (Note that it is not just another application of the general uncertainty 
relation: time (t) is not an observable, it's a parameter!) 

Application: exotic atoms. 

With the uncertainty principle in our hands, we can understand the structure of atoms a 
bit better. Think again of the hydrogen system. The Hamiltonian was 

H = -V{r) + ^ = -^-^A (8.43) 

Zm r 2m 

Imagine a wave packet needs to be put in the lowest energy state possible. You know 
the outcome of this problem, but let us ignore this for a moment. If a packet wants to 
have a low potential energy (first term of the Hamiltonian), it needs to be well localized 
around the nucleus (around r = 0). Such a packet will have a rather small AX. However, 
from the uncertainty principle, we know this implies AP is large. This means that -when 
viewed in the momentum perspective- the wave packet necessarily contains components 
of large momentum P. (Otherwise it can't have a large momentum variance.) This in 
turn is disadvantageous for the total energy, as it blows up the kinetic part P"^ /2m. It 
is precisely a balancing between the two extremes that gives an atom a finite size: the 
optimal way in between. A question one can ask, is the following: what if the electron was 
heavier? What would this change in the story? Well. As the mass of a particle is standing 
in the denominator of the kinetic term, a larger mass means the relative cost of kinetic 
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Figure 8.3: A muonic hydrogen atom is almost identical to an ordinary hydrogen atom. 
The only difference is that the electron is replaced by it's heavier brother, the muon. This 
particle sits in much smaller wave functions, and is much stronger bound to the nucleus. 



energy is getting lower - at least if one fixes the total momentuml^ So taking the same 
wave function (hence the same P'^) but for a particle with a higher mass m will have less 
kinetic energy P"^ /2m. In such a case, the particle might just as well sit a bit closer to the 
nucleus, lowering its potential energy, while not driving up his kinetic energy too much. 
Based on this reasoning, one expects a 'heavier electron' (if such existed) to sit closer to 
the nucleus, and hence to be better bound. Recalling the expression for the Bohr radius 
and the binding energy of the lowest energy state (n = 1), 

E{n = l) = -^o^ = -i?o = -^^ (8.44) 



h^ 



m, 



,/ce^ 



.45) 



we see that the binding energy is indeed proportional to rrie, and the Bohr radius a is 
inversely proportional to it. This confirms our reasoning. 

You may wonder if all this philosophizing is of any use. After all, the electron mass is 
just a fixed number, there is no way of changing it. Well, here is a reason why we would 
actually like to think about it. Apart from the particles you already know (electron, 
proton, neutron and photon) there are quite some other, more rare particles. They are 
generally short lived (getting created in some very special processes like particle collisions 
and nuclear decay) and rapidly decay to the more familiar particles. An example is the 
muon. It lives for about 2 microseconds, weighs 200 times as much as the electron, but 
has all other characteristics identical to the electron. It is actually possible to bind such 
a muon to a nucleus (during its short lifetime) forming a so-called muonic atom. The 



'^This is just the same statement as the fact that in classical mechanics, the kinetic energy p^/2m 
decreases with mass - if one fixes the momentum. Of course this is something very different from keeping 
the velocity fixed while decreasing the mass. In that case, the kinetic energy mw^/2 decreases, as expected. 
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j^ 



(a) ® (c) 

Figure 8.4: The H2 molecule, (a): Two nuclei far apart (no lowering of the energy), (b): 
Two nuclei close together (strong repulsion), (c): Intermediate distance: lowest energy 
state (bound system). 



energy-levels and atomic size are quite well described by the above two formula, if one at 
least puts in a mass 200 times the electron mass. This confirms both the above reasoning, 
and that the muon nicely obeys the Schrodinger equation, like all other particles! 

Application: ^vhat the universe is made of. 

With a reasoning similar to the one above, we can also understand the nature of chemical 
bonds a bit better. Try to think of the simplest molecule: H2 ■ This consists of two 
hydrogen nuclei, and one electron 'circling around' and binding the system together. How 
does the electron manage to keep the system together, and how does the system get its 
size? If you look at Figure 18.41 you see that there are three players in the game. First, 
there is the attraction of the electron to both nuclei. Second, there is the mutual repulsion 
between the two nuclei. Last, there is the uncertainty principle, or better: the electron 
wants to minimize its kinetic energy. If the two nuclei are too close together, their mutual 
repulsion will drive them apart. If they are far away of each other, the electron might just 
as well sit on one of the nuclei only, and the system breaks apart. However, if the two nuclei 
are at an intermediate distance, something interesting happens. On the first place, the 
electron can orbit around both nuclei, getting a low potential energy. On the second place, 
as the wave function of the electron is now larger, AX is big, and the uncertainty principle 
allows a very small AP. In other words: the electron can take on a very narrow range of 
low momenta, thus having only a very small kinetic energy. By 'taking on low momenta', 
we mean that ip{p) is only nonzero for very small p, which necessarily corresponds to small 
Ap. This system obviously has a lower energy than any other option, meaning it forms 
a genuine bound state. This is what keeps the molecule together. The very same effects 
are responsible for the structure of any other molecule, even if more electrons and more 
nuclei are involved. A rule of thumb is that electrons like to occupy a lot of space, taking 
on low momenta, while combining this with a wave function close enough to the nuclei. 
This is precisely what a chemical bond is. These bonds make molecules to what they are, 
and hence give structure to what most of the (known) universe is made of. 
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Exercises 

1. Explain to someone what the uncertainty principle is. What are the implications for 
measurements? How is it related to commutativity? 

2. By using linearity, show that Var(x) = {X'^) — {X)'^. Since the variance is a positive 
quantity (it's the expectation value of a square) this implies {X'^) > (X)'^. Check 
this property for the wave function tl^^x) which equals one on the interval [—1/2, 1/2] 
and equals zero outside that interval. 

3. For a particle in a box, L gives a vague idea of the position uncertainty. How large do 
you estimate the momentum uncertainty to be then? What is the (classical) energy 
of a particle with a momentum lying somewhere around that value? Compare this 
to the expression for the energy level that we found. (Just the form, not the exact 
prefactors - this exercise is only a vague estimate.) 

4. A team experimentalists can prepare atoms in an excited state. These atoms decay 
to a stable state, with a lifetime of 10~^ s. What is the uncertainty of the energy of 
the photons produced in their experiment? 

5. In the last exercise of the previous chapter, you computed the Fourier transform 
of the rectangular wave functions ipn{x)- (You should have gotten something pro- 
portional to a sine function.) Estimate roughly the variance of both ^„ and its 
transform (i.e. give a number characterizing the magnitude of this variance) and 
verify that again AXAP ~ h. 

6. Compute /S.X and AP for the first excited level of the harmonic oscillator. (Hint: 
express X and P in terms of creation and annihilation operators. Be careful with 
prefactors like in ()4.28p and ()4.29p !) Do these satisfy the uncertainty principle? 
What about the ground state? (Look at its wave function; it has a special form!) 
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Chapter 9 

Reflection and transmission of 
particles 

In this chapter... 

The chapters of this Part 3 will study different kinds of systems then what we have met so 
far. If you think about it, all the examples we have done considered particles which were 
trapped in a potential. In order of appearance: a box-potential, an oscillator-potential, 
and the Coulomb potential of the hydrogen atom. If a particle is not trapped/bound in a 
potential, we call it a free particle. This chapter will precisely deal with such free particles. 
We will see that -just like trapped particles- they have some special and unexpected 
properties. One of them is the phenomenon of tunnelling, where a particle can move trough 
a 'wall' (potential barrier) which according to classical mechanics would be impenetrable. 
In the first part, we prepare ourselves a bit: we study the SE in potentials which 
consist of several constant regions and we introduce the notion of probability current. 
Then, we explain the phenomenon of tunnelling and why it is relevant in nature. Finally, 
we compute the probability for a particle to actually tunnel though a potential barrier. 
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9.1 iio Particles in constant potentials 

For this chapter, we will study particles in potentials, which are composed of several 
regions in which the potential is constant. To understand the behavior of such systems, 
we need to solve the SE in each part separately, and then glue together the solutions. 
Here, we will tackle the first part of that problem: solving the SE in a region where the 
potential is constant. 

So imagine we a particle, with total energy E, traveling through a region where the 
potential is constant: V{x) = V. Because the particle has energy E, we have 

H\i;)=E\i;) (9.1) 

with iIj{x) the wave function of the state {ip). This means that 

Htl;{x) = - — -^ + y^(x) = i?V(x). (9.2) 

so 

^ = -?=<|^*W. (9.3) 

There are two possibilities. Either the value of the potential is lower than the total energy 
of the particle, or it is higher. We treat those cases separately. 

Case 1: V <E 

In this case, E — V \s positive, and we define a real and positive parameter k: 

e = M|^ p.4) 



-k^il)(x) (9.5) 



With this definition equation ()9.3p becomes: 

dx"^ 



This is a differential equation that we have met before. We can write the solution as a 
sum of a sine and a cosine, or in terms of complex exponentials. Here we will choose for 
the latter, so we write the general solution of the above equation as 



V'(x) = Ae'"'' + Be-'"'' (9.6) 

with A and B complex numbers. (If you use e*^ = cos x + i sin x, the above expression can 
be turned into a sum of a sine and a cosine again, be it with different coefficients.) This 
means the full, time dependent solution is given by: 

^{x, t) = V'(x)e-^-^*/'"' = Ae^'^^-*'^* + 5e-^^^-^'^* (9.7) 
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with uj = E/h. This is just the sum of two waves, one traveUng to the left, and the other 
travehng to the right. Using 

kx-ujt = c^ x= (c/k) + {uj/k)t (9.8) 

we see that the wave of amphtude A is travehng to the right, with speed v = co/k, and 
the wave with amphtude B is travehng to the left, with speed v = —co/k. 

The interpretation of this should be quite clear: if a particle has enough energy to be 
in a certain region (so E > V), then the solutions are just waves. By superposition of 
such waves, we can build wave packets traveling around in such a region. 

Case 2: V > E 

Here, it is y — £^ which is positive, so we define a real and positive parameter ^ as follows: 



Equation ()9.3p now becomes: 



The solutions are now: 



^=/^V(x) (9.10) 



iP{x) = Ae^"^ + Be-''^ (9.11) 



with A and B again complex numbers. These solutions are not ordinary waves anymore, 
but represent wave functions that are either damped or blowing up for large x. This 
means something special is happening - we will say more about this later. Anyhow, you 
might have expected something special to happen: if in a certain region V > E, it means 
that the particle does not have enough energy to be at that place - according to classical 
mechanics. So it is not strange that we do not find any ordinary wave solutions here 
anymore. This behavior will be explained later in this chapter. 

9.1.1 Probability current 

Another thing we will use in this chapter is the notion of probability current. Typically, 
in a system where there is a density p in play (energy density, particle density, ...) people 
like to write an equation of the following form: 

1^ = -V • J = -(a,j, + dyjy + dj,) (9.12) 

This is called the continuity equation. In just one dimension, it becomes: 

^ = -d.j. (9.13) 



CHAPTER 9. REFLECTION AND TRANSMISSION OF PARTICLES 138 

This can easily be shown to imply that the total quantity Q = J_ p{x) dx is conserved. 
By conserved, we mean that it does not grow or shrink in the course of time: dtQ = 
0. (Showing this needs the assumption ia;(+oo) = jx(— co) = 0.) The object j{x) is 
then called the conserved current associated to the density p. For example: if Pe{x) 
represents an energy density, and the total energy J_ Pe{x) dx of a system is conserved 
(does not grow or shrink with time) then there will typically be a conserved current 
Je{x) such that together with pE it satisfies the continuity equation. Similarly, if Pp{x) 
is a particle density, and the total number of particles /^ Pp{x) dx is conserved, then 
there typically is some conserved current jp{x) in play. In quantum mechanics, there is a 
continuity equation as well. Recall that |V'(2;)p has the meaning of a probability density, 
and since the total probability /^ dx p{x) = J^ dx\'tlj{x)\^ is constant (it equals one for 
any time t) it is conserved. This means we can probably set up a continuity equation for 
the probability density. Let us try to find it. By the product rule of derivatives, 

^H =^^+9^^ (9-14) 

To proceed, let us consider a freely moving particle. This means we consider V{x) = 0. 
In that case, the SE tells us that 

^ = T^AV (9.15) 



dt 2m 
and by taking the complex conjugate 

If this step is not obvious, try to write out ^ as a sum of its real and imaginary part, write 
out the derivatives, and then apply complex conjugation to arrive at the above expression. 
Plugging these two equations into (j9.14p . we get 

^IV'P = ^(^^AV-MV'*) (9.17) 

= — V(V'*VV'-'0VV'*) (9.18) 

2m 



If the second step is not clear, write out the derivatives in components, like Aip = ^^ dfip 
or V{v) = '^^diVi. Ah! The above equation is a continuity equation indeed, for p = |V'P 
and 

J = ^(^*VV' - VW) = — Im(V'*VV') (9.19) 

2m m 

The interpretation of j is then as follows: if a wave packet is moving from one place to the 
other, the probability is flowing from this one region to the other. The size of the flow at 
each position x is precisely given hy j{x). 
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9.2 



Tunnelling and nuclear decay 



In this story, we will fresh up our knowledge about the nucleus. You know it consists of 
protons and neutrons. You also know that these are bound together very tightly. You 
may wonder how this is possible: after all, the protons are all positively charged, so they 
repel each other. The secret (which you may or may not have heard of) is the so-called 
strong force. This is a force between particles, just like electromagnetism and gravitation, 
but with some special features. First of all: it has a very short range. As you know, the 
influence of gravitation and electromagnetism can work on very large distances. Arbitrary 
large, in fact. Both forces have a 1/r^ behavior - which gets small for large r but never 
vanishes. However, for the strong force, this is not true. It can only work between particles 
that are very close together, at a distance of about 10~^^m, approximately the size of the 
nucleus of an atom. Another special feature is that this force is always attractive, and 
only felt by protons and neutrons. That means electrons do not feel this force at all. 
This is a bit similar to the fact that the electric force is not felt by the neutron (because 
it is neutral). Lastly, this force is very strong - or better: within its small range, it 
dominates the electromagnetic force by far. This in turn is a bit similar to the fact that 
the electromagnetic force between individual particles is much much larger than their 
mutual gravitational force - that's why we can safely ignore gravitation when dealing with 
small systems. To summarize: 



Force: 


Range: 


Strength: 


Affected particles: 


Strong 


~ 10-^^m 


very strong 


p+,n^ 


Electromagnetic 


oo 


intermediate 


charged particles 


Gravitational 


oo 


very weak 


all particles 



With this information, you can understand better why a nucleus sticks together. If 
neutrons and protons are very very close together, they are within the range of eachs 
others attractive strong force. This force is much larger than the electrostatic repulsion 
between the protons, so the nucleons stick together tightly and well-bound. This way, 
they form a nucleus, and the size of that nucleus is of the same order as the range of the 
strong force. 

This sounds very cosy, but it turns out that if there are too many nucleons sitting 
together, the nucleus gets crowded, and some of the nucleons might want to get out. 
Dividing the nucleons over two nuclei might be energetically more advantageous than 
packing them all together in a single crowded one. This is precisely what is known as 
radioactivity. Typically, elements with too many nucleons - such as uranium, for example 
- have the tendency to fall apart after a whilejj By falling apart, they release some energy: 
the energy of the initial state minus the energy of the final state. So in a large bunch of 
atoms of an unstable kind, there are all the time decays occuring. Each such decay releases 



'^Also some smaller nuclei can be unstable, but this is for other reasons. 
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a bit of energy. It is this ongoing energy emission - in the form of radiation - what signals 
that a chemical element is radioactive. 

Hopefully you now have a better understanding of what the nucleus is. However, 
there is one logical gap in the explanation. We have said that the strong force (within its 
range) is always bigger than the electrostatic repulsion. So it seems that once the protons 
and neutrons are locked up in a nucleus, they can never get out. Not even when the end 
product has a lower energy and thus is favorable. From this we would incorrectly conclude 
that there are no unstable nuclei. How can we understand this better? To answer this 
question, look at Figure 19.11 There, the total potential (of the electrostatic force + the 
strong force) is shown. If a nucleon was a point particle, it would just tend to move hence 
and forth in the potential well. Every time it reaches the boundary of the nucleus, it slows 
down, stops and moves back. So in this description, a single particle (or a cluster of them) 
can never escape, and we can not explain radioactivity. 

However, quantum mechanics tells us this description is wrong. We know that the 
protons and neutrons (just like electrons) are each described by a wave function, which 
has a nonzero size. Now the result of such a wave packet hitting a potential barrier is very 
interesting. There is always a small part of the wave function that crosses the barrier, even 
if the energy of the wave packet is much lower than the height of the potential barrier. 
This is the so-called tunnelling effect. It is shown in Figure 19. 2[ We will study this 
effect quantitatively at the end of this chapter, and give an expression for the amount of 
the wave packet that is let through the barrier. For now, knowing this effect is enough 
to understand the puzzle here. Indeed, the tunnel effect implies there is always a small 
chance for a a particle to cross a barrier, and hence for one or more nucleons of an unstable 
atom to leave the nucleus. This means nuclei can decay (at least: if the end result has a 
lower energy) even if the barrier to escape the nucleus is classically impenetrable. 

This quantum effect was first studied by George Gamow, who used the tunnel effect to 
describe the process of alpha decay. In alpha decay, two neutrons and two protons leave 
the nucleus at once. These four particles are exactly the same as a helium nucleus, and 
such a cluster is also called an 'alpha particle'. Using the tunnelling probability formula 
(which we will derive shortly) Gamow managed to explain a relation between the emitted 
energy in alpha decay process and the decay time. The decay time is the typical time 
an unstable atom lives before decaying and depends on the specific isotope. This relation 
matches very well with experiments, confirming the quantum mechanical computation. 

Before we go to the calculations, two small remarks on the explanation above. First, 
you may wonder why we can describe the nucleus as a point when we are dealing with 
electrons in an atom. Indeed, in the hydrogen case we just took the nucleus to be a point 
charge. The above explanation suggests we should describe all particles in the nucleus 
by their wave function, not as a single point. This would obviously make the problem of 
studying electrons in an atom much harder. What saves the day is of course the strength of 
the strong force, which squeezes the wave functions of nucleons together very tightly. This 
makes the nucleus look extremely small - much smaller than the atom itself. A popular 
comparison is that is the nucleus sitting in the atom is as small as an ant sitting in a 



CHAPTER 9. REFLECTION AND TRANSMISSION OF PARTICLES 



141 



V A 




Figure 9.1: The total nuclear potential for a proton and a neutron. Both feel the attractive 
strong force. This creates the deep pit around the origin. On top of that, the proton 
experiences the repulsive coulomb potential. For both particles the typical energy lies 
below the height of the barrier, meaning they can not escape. 



large football stadiumo That is why we can safely treat it as a single point when we are 
describing the electrons. A second pitfall we want to prevent is a too literal interpretation 
of Figure [212] in the context of the nucleus. Just like the wave functions in a box, the wave 
functions of nucleons stretch out over the entire nucleus. They are not just small wave 
packets bouncing around, so Figure W^ gives a bit a simplistic image of this situation. For 
cases where a free particle hits a potential barrier, this figure is precisely what happens, 
and it will be this situation which we will describe below. So the two situations (free 
particle hitting a potential barrier and an alpha particle leaving the nucleus) both involve 
the same tunnelling effect - although the situations are not strictly identical. Got it? 



9.3 



Particles and barriers 



We will study two problems in this part. The first one considers a particle, traveling 
freely, until at some point in space it encounters a sudden potential step. We will ask 
what happens to the particle. If it travels onwards (into the region where V is higher), we 
say the particle is transmitted. When it is sent back to where it came from, we say that 



The size of the total atom is of course determined by the approximate size of the cloud of electrons. 
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Figure 9.2: When a classical (point) particle hits a potential higher than its energy, it is 
reflected. When a wave function hits a barrier, there is always a part which is transmitted. 
This implies there is a nonzero chance for the particle to go through the barrier. 



the particle has been reflected. We want to find out the relative probabilities for those 
two options. The other problem deals with a potential barrier: a narrow but high bump 
in the potential. Classically, the particle will be reflected by the bump if it is higher than 
the particles' energy, but according to quantum mechanics, there will always be a nonzero 
chance for the particle to boldly cross the barrier. This is precisely the tunnel effect we 
were talking about before. 



9.3.1 The potential step 

This describes a particle traveling in a potential that has a step at some place, say at 

x = 0: 

, . _ f for X <0 

^^^^ ~\ V for X > 

Also, let us take the energy E of the particle to be larger than V. We then know from the 
first part of the chapter, that the solution of the SE in each part of the potential is given 
by ordinary waves: 



V'(x) 



ipi{x) = Aie'''^'-' + Bie-'''^'-' for a; < 
^2(x) = ^ae*''^^ + B2e^''^-''' for x > 



with ki = ^/2mE/h and /c2 = ^Jlvti(E — V)/h. The time evolution of this solution is just 
given by multiplying the above wave functions by e~ . To have a healthy solution we 
need to glue together the part ^i on the left to the part ^2 on the right. In particular, to 
have a descent wave function we need the function ^p{x) to be continuous at x = 0. Also, 
to ensure the wave function does not have a strange kink, we will demand the derivative 
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V(X)^ 



Figure 9.3: The potential step. When a particle with energy E > V hits this barrier, 
there is a chance R to be reflected and a chance T to be transmitted. This contrasts with 
the description of particles as points. There a particle will always be transmitted when 
E>V. 



of ijj to be continuous at x = as well. These requirements give the following system of 
equations: 

V'i(O) =^^2(0) ^ Ai + Bi = A2 + B2 (9.20) 

V''i(0) = V'2(0) ^ ki{Ai - Bi) = k2{A2 - B2) (9.21) 

Now some more physical input. The solution ()9.20p contains four parts: two waves in the 
region x < (one going to the left and one to the right), and similarly two waves in the 
region x > 0. We want to describe a particle coming from the left, falling on the potential 
step, and check how much of the initial wave is reflected and how much is transmitted. 
This means that in the region x > we only want to consider waves that are going to the 
right. Hence we will put 

B2 = 0. (9.22) 

Also, we only want to study the ratios of the transmitted and reflected waves compared to 
the incoming wave. This means we can just normalize with respect to the incoming wave, 
and put the amplitude of the incoming wave to one: 

Ai = 1. (9.23) 

The conditions ()9.2ip then become 

1 + ^1 = ^2 and ki{l - Bi) = k2A2 

which has as solutions 

2A;i „ k-i — k2 , , 

^2 = 7— V' ^^ = T-—r- 9.24 

Kl + K2 ki + fc2 
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Wave packets 

It should be clear that the coefficients A2 and Bi are closely related to the probabilities 
for a particle to be transmitted or reflected by the barrier. If A2 is large, a large portion 
of an incident wave will be going into the region x > 0, which can be interpreted as a large 
chance for the particle to be transmitted. 

However, real particles are never an infinite wave, but a limited packet. (Recall that we 
described an infinite wave as mere caricature of a wave packet of which the momentum is 
sharply peaked around some value.) But we know from the chapter on Fourier transforms 
that such a packet can nevertheless be described by a superposition of infinite waves. So 
we only need a small step to convert the above results into a statement about particles. 
Say we start out with a wave packet 'ip{x) that is initially located in some region on the 
side X < 0, and that is is traveling to the right. In momentum representation, we can 
write 

1 r°° 

iIj{x) = ^= dkie'''^''^{h). (9.25) 

V27r J-00 



In this last equation, we the waves e^^^ in terms of the wavenumber, and not in terms 
of the momentum p = hk. This means we do not have to include any extra factors h 
anywhere: we can just use the original expression for a Fourier transform which we met 
in the first section of Chapter 8. The normalization condition phrased in terms of ip{ki) 
is 

/■oo 

dki\^l;{ki)\'^ = 1 (9.26) 



The previous section showed that a wave e^^^ which falls on the potential step is broken 
up in two parts: 

(We suppress the time evolution for elegance.) This is true for all k, so for the wave packet 
(j9.25p in its entirety will break up into 

1 /'OO -I /'OO 

dkiBie-'^^''i){ki) + ^ dkiA2e'^^''i){ki) (9.28) 



27r J-oo V 2vr J -00 

What we want to do, is to find the amplitudes of the two parts. First, let us consider the 
reflected part. Its total amplitude (=the norm squared) will be interpreted as the chance 
for reflection: 

/oo 
dki\Bi\^\'4){ki)\^ . (9.29) 

-00 

Similarly the total amplitude of the transmitted part can be interpreted as the chance for 
the particle to go over the potential step. We compute the values of R and T below. 
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Finding R and T 

We start with the reflected part. To do the integral ()9.29p note that Bi depends on ki, 
so it is not just a constant. However, things simplify if we assume that the initial wave 
function ip{ki) is sharply peaked around a single value (say ki) and zero at other places. 
Then the relevant range of integration in the above integral becomes a very small region 
around fei, and we can take Bi to be approximately constant: namely, just its value when 
evaluated for ki. Using this, and the normalization ()9.26p . we get 

/oo 
dki\i,{ki)\^ = \Bi\^ (9.30) 

-oo 

As said, it is understood that Bi is evaluated at the peak value ki of the incident wave 
function. Now let us consider the transmitted part, the last term in ()9.28p . Let us denote 
that wave packet by ipxix)- Because of the relation between ki and k2 

2 ,2_^Em 2{E-U)m _2Um 

we can change the integration variable to k2, and get 

dk,A2e''''^i;{k,)= / dk2^A2e''''^i,{k,ik2)) (9.32) 

-oo 7-00 9k2 

In the object on the right hand side, the integration variable (^2) is the same object 
as what occurs in the exponentials - unlike the integral on the left. So the momentum 
representation i{jTik2) is to be read off from the integral on the right. Using (|9.3ip we can 
see that f^ = I7 and we find 

Mk2) = ^A2ijiki) (9.33) 

fei 

where ki depends on k2 via ()9.3ip . With this, the total amplitude of the transmitted part 
becomes 

T= r dk2 %A2\'mk,)\' = r dk,^%A2\'mk^)\' (9.34) 

Now again we use the fact that ip{ki) is sharply peaked, so we can approximate the objects 
inside the integral by using in the value ki and the corresponding k2'- 

T = ^$\A2\' r dk^mk^)\' = h\A2\'. (9.35) 

^2 ^1 J-00 1^1 

Where A2 is to be evaluated at ki and k2- In conclusion, the chances for our incident 
particle to be reflected or transmitted, are given by: 

„ , „ , 9 fk\—k2\ , „ ^2 , , , 9 4A:i k2 
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Getting here was a bit nasty, but as a rather non-trivial check, you can verify that these 
probabihties nicely add up to one: 

R + T = l. (9.37) 

Two more remarks to wrap this all up. First: you may wonder why we changed integration 
variables twice when deriving T. Once from ki to /c2 and once from k2 to fci. Is this really 
necessary? Couldn't we just have left ki there all the time? The answer is no. We had 
to switch to /c2 to correctly read off the wave function V't(^2)) since the waves in the 
integrand were expressed in terms of e*'^^^ and not e**^^^. The second change of variables 
was there to pull out the integral /^ dki\ip{ki)\'^ = 1 again, like we did in the reflection 
part. A second comment is about time dependence. We have at no point written any of the 
^-lEt/h factors. Is that ok? Well, since we were considering waves with different /c, these 
also have different energies E. If the wave function ^{k) is sharply peaked around one A;, 
the energies of the different components are very comparable, and the time dependence 
really is just an irrelevant factor e~ ' showing up everywhere. So no, it does not change 
much if we include this factor or not, but yes, to be correct we should have. This would 
introduce some phases that are present in the outgoing wave packets, but which of course 
do not change the norms, and hence not the outcome. The main goal of this derivation 
was not to include all the details, but we did manage to find expressions for R and T. 
And as we anticipated, they are just proportional to Bi and A2. 

Probability currents 

The above construction of wave packets has two faces. On one hand, it makes clear what 
is happening: a wave packet gets split and the amplitude of both parts precisely give the 
probability for transmission or reflection to occur. On the other hand, it is a bit a long 
way round to get the end result. Is there a shorter way to obtain T and Rl You bet. The 
tool we can use is the probability current we met in the beginning of this chapter. The 
current on the left side [x < 0) and right side [x > 0) are: 



i,W = -Im^^*.W-|^j=^(l-|B>l^) (9.38) 

■ f \ ^T ( I ( N*^V'2(x)\ ^fc2 2 /non^ 

n{x) = -^^[Mx)^^)=—\A,\ (9.39) 

The current j'l consists of two parts. The positive piece Ji{x) = -^ corresponds to a 
probability current moving to the right. This is the incoming wave. The negative piece 
^T ~ "m'l-^il^ ^^ ^^^ reflected part. The current j2 is just the probability current corre- 
sponding to the transmitted wave. Relative to the incoming current jj^, the transmitted 
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and reflected currents are 

T = 4 = -i^2p = 77 -i^- 9.40 

it kt {ki + k2f 

R = ^ = ii?ii^ = f ^4^v (9.41) 

j+ \ki + k2) 

Which confirms our previous results via a nice shortcut. Nevertheless, the interpretation 
of T and R remains most clear in the wave packet view. We just bare in mind that infinite 
waves are not fully physical (not normalizable) - but the wave functions constructed from 
them are. 

A last comment on the result of the computation. Note that (unlike what you would 
guess from classical mechanics) there is always some nonzero chance for reflection. So 
even if a particle can easily cross a potential step, there is a (small) chance for it to be 
bounced right back. There are several experimental situations where particles meet a 
small potential step, typically when particles are traveling from one type of material into 
another one. Even if this step is small enough for all particles to pass (given their energy) 
there are some which are reflected - in accordance to the above expression. 

9.3.2 The potential barrier 

We now consider a potential barrier: 



V{x) 



V for — a < X < a 
everywhere else 



and a particle coming in from the left. The solution of the SE in the three regions is: 

ipi = Aie^^^ + Bie~'^^^ for x < —a 
^(x) = <( ^2 = ^26"^'^ + -Bae^^ for -a<x<a (9.42) 



7/>3 = A'j,e for X > a 



with 



\/2mE J2miV - E) 

/i = -^ ^- (9.43) 



Demanding continuity at x = — a and x = a amounts to 

^^(-a) = 7/;2(-a) ^ Aie~'^'' + Bie'^'' = A2e^"' + B2e-^''' (9.44) 

^lj[{-a) = ^2(-o) ^ ikAie-'^'' - ikBie''"' = -^26^" + ^^26"^" (9.45) 

V;2(a) = V3(a) => ^26-^^^ + ^se^" = ^se'^" (9.46) 

V;^(a) = ^^(a) ^ -nA2e~^"' + iiB2e^"' = ikAse''"' (9.47) 

This is a homogeneous system of 4 equations with 5 unknown parameters Ai, Bi, A2, B2 
and ^3. We will put Ai = 1, just like we did for the potential step. This gives 4 equations 
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and 4 unknown parameters. Solving for ^3 yields 



I A |2 



(1 - A:2/^2)2 sinh2(2^a) + (2A:/^)2 cosh2(2/za) 
The currents on the left and right are now 



Ji = -{l-\B,\')=jt 
m 

fik , , ,9 

J3 = — MI3 

m 



Ji 



(9.48) 

(9.49) 
(9.50) 



so we see that {A^l"^ (= jf /js) is the transmission probability. Striking enough, even if 
E > V (which is implicitly expressed by the reality of /i) there is a nonzero chance for the 
particle to cross the barrier. In the limit fia >> 1 (broad barrier, large energy deficit) the 
transmission probability (|9.48p becomes 



I A, 



ie 



-ifia 



kfl 



k"^ + fi"^ 



.E 



4e-.._M ^ 



E 



(9.51) 



so the transmission probability decreases exponentially with the barrier width a. The 
incoming, reflected, and transmitted probability currents are shown on the figure below: 




J. 



Figure 9.4: The incoming, reflected, and transmitted probability currents at a potential 
barrier. The amplitude of the transmitted current gives the probability for a particle to 
tunnel through the barrier - even if this is not allowed classically. 
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9.3.3 Application: Scanning Tunnelling Microscope 

A nice application of the tunnel effect is the Scanning Tunnelling Microscope. This is an 
extremely high-resolution microscope, which relies on the tunnelling effect. The device 
consists of a sharp 'scanning tip', which is put at an elevated electrical potential with 
respect to the sample one wishes to study. By bringing the tip close to the sample, 
electrons will tunnel through the empty space between the tip and the sample - even 
when the two are not touching. This way, a current arises. The size of the current tells 
something about the distance between the tip and the sample. This way, one can 'feel' 
the shape of the sample, at a distance. (This distance is very small: the tip has to be 
extremely close the sample to realize a nonzero tunnelling current.) A modern STM has 
such a good resolution, that it can easily distinguish single atoms on the sample surface. 
Pretty impressive! 
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Figure 9.5: A Scanning tunnelling microscope. The amplitude of the tunnelling current 
allows to scan the surface in an extremely sensitive way. This allows to see objects as 
small as individual atoms. 



9.3.4 Application: It's warm outside! 

You probably know that our beloved sun is a very large fusion reactor. At its extremely 
high temperature, hydrogen nuclei fuse to helium nuclei. These are more strongly bound, 
so there is an energy gain in the process. This energy is radiated away and eventually 
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reached us on earth. If you use what you have learned in this chapter, you may see a 
problem. The very barrier which prevents particles from the nucleus to escape is also 
there when two nuclei try to merge. So classically, it is very hard (almost impossible) 
for two hydrogen nuclei to fuse into a helium nucleus. Ah - this means that once again 
tunnelling plays a crucial role. Indeed, the tunnelling process gives rise a very small but 
nonzero fusion probability, just enough to make the sun into the bright shiny object we 
see in the sky. So the next time you and your friends are enjoying a sunny day, make sure 
to mention that none of this would be possible without quantum tunnelling. Caution: for 
the well-being of you social life you might better not say things like that out loud. Well, 
there still is the sweet inner joy of insight, right? 
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Exercises 

1. An electron with energy 6 eV hits a 10"^" m long barrier of 8 eV high. What is the 
chance for the electron to go through it? 

2. The same question, but now the barrier is 0.01 m long. Do you have to be afraid that 
your cup of tea spontaneously empties itself by tunnelling of particles (or molecules) 
through the sides? 

3. A proton with energy 2 keV suddenly travels into a region where the electric potential 
is 10 V higher. What would happen if a proton was a charged point particle? Using 
quantum mechanics, what is the chance for the proton to be reflected? 

4. Prove that the charge corresponding to a conserved current is indeed conserved in 
time. Can you do the three dimensional case? What expression would you take for 
Q? Can you show it's conserved? 

5. Study a situation where a particle (coming from the left) drops of a potential step. 
So V{x) = y on the left, and F = on the right. Solve the SE, write boundary 
conditions and find R and T using probability conditions. Compare the result to 
what you expect classically. 

6. Challenge: try to solve the potential well: a particle stuck in a potential which has 
V{x) = —V for —L < X < L and zero elsewhere. Can you recover the particle in a 
box by taking the appropriate limit? 



Chapter 10 
Spin 



In this chapter... 

By now, you have become quite experienced in dealing with particles in potentials. As 
a consequence, you also know how to deal with electric fields. Just write down the cor- 
responding potential (such that —W = F = qE), solve the Schrodinger equation of a 
particle moving in this V, et voila. In fact, this is precisely what we did in the chapter on 
the hydrogen atom. For a random electric field and the corresponding potential, solving 
the SE may be a hard problem, but you can always do this in principle. You or your 
computer just need to be good enough at solving differential equations. 

That is very nice, but it leaves an important question unanswered. What about mag- 
netic fields? You know that there is no such thing as 'the magnetic potential' so the 
influence of magnetic fields can not be summarized by giving a potential V. So what 
about those situations, in which a particle is immersed in a magnetic field? It is clear that 
there will be some new input here. Indeed, we will meet a new and important concept: 
spin. This is a property of particles which not only determines their behavior in magnetic 
fields, but also in other situations. Reason enough to have a closer look! 
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10.1 



Tools 



10.1.1 Magnetic moment 

As you very well know, a free electric charge will start to move when immersed in an electric 
field. If the electric field is created by two distant source charges of opposite sign, the free 
charged particle will move towards the source charge with opposite sign. Something nice 
happens when you attach two small opposite charges to each other to form an electric 
dipole still immersed in the same electric field. They will each pull towards one side, 
and stay put by force balance. Hence, the only thing that happens is the alignment of 
the electric dipole with the electric field. Now let us compare this with magnetism. In 
that case there are no monopoles, that is: there are no isolated magnetic charges. But of 
course there are magnetic dipoles - which can be thought of as a little magnet. Just like 
the electric dipoles, they tend to align with an external field, which is magnetic in this 
case. So the energy for a magnetic dipole to be aligned with an external field is lower than 
when it is anti-aligned. If we denote the magnetic moment (=strength of the dipole) by 
M, and the external field by B, the energy W of the dipole is 



W 



-M-B 



:io.ii 



This is indeed lowest when M 
in the figure below: 



B, so that alignment is preferred. The situation is shown 
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Figure 10.1: An electric dipole aligns with the external E-field. Similarly, a magnetic 
dipole aligns with the external B-field. Notice that the electric dipole is conventionally 
defined as a vector pointing from the negative to the positive particle - not to be confused 
with the definition of the electric field, which points from positive to negative. 
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10.1.2 Larmor precession 

We just refreshed the weh- known fact that a magnetic dipole tends to ahgn with an 
external magnetic field. This of course can only happen if the dipole can dispose of the 
energy released by aligning. Otherwise, it will typically perform an oscillating movement, 
not able to find its equilibrium value because of conservation of energy. Similarly, a ball 
can only roll to the bottom of a pit and stay there because of the energy lost while rolling 
down. If there was no friction, the ball would be doomed to roll in and out constantly. In 
the following, we will show that a magnetic field indeed can keep a dipole in an oscillatory 
motion, or more precise: a rotation. This phenomenon is called Larmor precession. 

To be concrete, let us consider a small electric loop current - like a charge running in 
a circle. As you know from electromagnetism, the circle movement of a charge creates a 
magnetic field: a dipole. If S is the angular momentum of the circling charge, the induced 
magnetic moment M is proportional to it: 

M = jS (10.2) 

The proportionality constant depends on the details of the loop current (which we do not 
consider here) and is called the gyromagnetic ratio. As we just refreshed, a magnetic 
dipole feels a force trying to align it with the external B-field. This can be expressed as a 
torque, trying to turn the dipole in the direction of B: 

T = MxB (10.3) 

The direction of the torque is shown in Figure 110.21 Just like a force causes the momentum 
to change, the torque causes the angular momentum to change: 

r = ^{fxp) = -^S (10.4) 

dt^ '^ dt ^ ' 

Where S as before denotes the angular momentum r x jJof the orbiting charge. Combining 
the last three expressions, we get 

^S = jSxB. (10.5) 

dt 

Taking the magnetic field directed along the z-axis, we get B = (0, 0, B) with B the 
magnitude of the field. Hence, the components of the above equation are: 

"^ S, = -/{SyB, - S,By) = jSyB (IO.6) 



dt 
d_ 

di 

J^S, = j{S,By-SyB^)=0 (10. 



Sy = j{S,B, - S^B,) = -^S,B (10.7) 
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Figure 10.2: Left: direction of the torque, exerted on a magnetic dipole in an external field. 
Both B and /2 are in the plane of the paper, while r is pointing directly towards you. Right: 
the circular movement arising from the torque: Larmor precession. This movement is quite 
similar to that of a planet in its orbit. In that case the linear momentum is constantly 
changing due to the force acting. For Larmor precession the angular momentum of the 
dipole is constantly changing due to the torque acting. 



The last equation implies Sz is constant in time. The first two combine to —^ = 
—j'^B'^Sx, which has solutions of the form 

Sx = Acos{ujt - (j)) (10.9) 

with u! = \'y\B. This number is known as the Larmor frequency. The total dynamics is 
thus given by 

S = {Acos{ujt - (p),±Asm{ujt - (p), Sz) (10.10) 

with the sign of Sy determined by the sign of 7. The total motion is shown in Figure [10. 2i 



10.2 



Spin 



10.2.1 Zeeman effect 

In the previous example we considered a magnetic dipole arising from an orbiting charge. 
Where could we naturally meet such a system? Precisely, the electron in an atom! Indeed, 
it is charged and every state with i > 1 has nonzero angular momentum: 



L'^\n,i,m) = h^i{i+l)\n,i,m) 



(10.11) 



From this you might infer all states with i > 1 have a magnetic dipole moment. For 
a classical magnetic dipole that is caused by an angular moment, the motion is just 
the Larmor precession described above. But what happens according to the full-fledged 
quantum mechanical description? First, write 



M = -fL 



(10.12) 
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Figure 10.3: Under the presence of a B-field, the degenerate energy levels of an electron 
are split into several distinct levels. This confirms that the electron's angular momentum 
creates a magnetic moment that couples to the external B-field. However, this is not 
enough to explain the observed splitting. It seems that there is an extra contribution. This 
is due to the intrinsic angular momentum of the electron (the spin) and the corresponding 
intrinsic magnetic moment. 



where 7 is the gyromagnetic ratio relating the angular momentum and the magnetic 
moment of the 'orbiting' electron. This makes M is the quantum mechanical operator 
corresponding to the magnetic moment. Moreover, M' = j*U = 7L = M so that M 
is Hermitian - meaning it is an observable. As we noted before, the classical energy of a 
dipole in a B-field is given hyW = —M-B. This suggests that for the quantum description, 
we have to add to our Hamiltonian a piece: 

Hl = -M ■B = --fL-B (10.13) 

So that the total Hamiltonian for an atomic electron in a B-field (0, 0, B) becomes 



H = H„ + Hi 



|-A + V'|-7BL. 



(10.14) 



The electron states \nim) will also be eigenstates of this new Hamiltonian, but with a 
different energy. Indeed: 



H\n,i,m) = {Ho + HL)\n,i,m) = {En - -fBhm)\n,i,m) 



(10.15) 



This is a very interesting result. Without the B-field, all states with the same main 
quantum number n have the same energy £"„. However, in the presence of an external 
magnetic field the total energy E = En — ^Bhm also depends on m. Hence, the energy 
degeneracy is partly lifted: it now depends on n and also on m. Originally, it was Zeeman 
who discovered this shift of the energy levels of atoms in the presence of a B-field. The 
Zeeman effect for hydrogen partly agrees with the above reasoning: indeed, spectral lines 
split up due to the magnetic field. That's nice, but there is one big problem: the splitting 
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is not just —■jBhm. It seems as if on top of the angular momentum of the electron(s), 
there is an extra contribution to the magnetic moment, not due to the angular momentum 
of the wave function. After a long history of guesses and other experiments, physicists 
found a resolution to this paradox: the electron must have an intrinsic angular moment 
and a corresponding intrinsic magnetic moment. This property is called the spin of 
the electron. A way to visualize this spin, is shown in Figure 110.41 Imagine an electron 
would be a small charged sphere, rapidly rotating around one axis. Then it would have 
an angular moment, and a magnetic moment due to the rotating charge of the sphere. 
This picture is not really correct (an electron obviously is not just a charged sphere) but 
it gives a nice classical idea on how such an intrinsic angular momentum may arise. 





Figure 10.4: A current in a loop creates a magnetic moment. Similarly, a rotating charged 
sphere would do so. This gives a pictorial idea on how the intrinsic magnetic moment of 
an electron might arise. Be careful: this picture is too simplistic. We will give a correct 
and more detailed description below. 



10.2.2 Stern-Gerlach experiment 

Another important experiment in the discovery of the electron spin, was that of Stern and 
Gerlach. They prepared vaporized silver atoms in a furnace, and fired these into a region 
with an inhomogeneous magnetic field: 

B = {0,0, B{z)) with d,B{z)>0 (10.16) 

The setup is shown in Figure [10.51 If the atom has magnetic moment M, the force exerted 
on it by passing through the field is given by 

F = -VW = V{M-B) (10.17) 

The last expression only has a nonzero z-component, so the force acting on the atom is 
given by 

F, = M,d,B{z). (10.18) 

There are several remarks in place. First, the force is only due to the gradient of the 
magnetic field. Also note it is proportional to the z-component of the magnetic moment 
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Figure 10.5: The Stern- Gerlach experiment. The B-field is directed along the z-axis, and 
also increases in strength in that direction: dzB{z) > 0. On the right, the result of the 
experiment is shown: detection occurs at two central bright spots. This contrasts with 
the classical expectation, which predicts detection along a wide and smeared out region. 



only. From the part on Larmor precession, we know that the effect of a B-field on the 
magnetic moment is rather simple: precession occurs in the x- and y- direction, but M^ 
stays constant. In conclusion: the atom is deflected by the field, in the direction of the 
force Fz oc M^. If Mz > 0, the atoms will be sent upwards, for M^ < they bend off to 
lower z. Since the magnetic moment of the atoms can be oriented in any direction, we 
expect Mz to take on a continuous region of values. This would imply the particles hit 
the wall on the right of the detector along a continuous region. This is shown with the 
dashed line ('classical expectation') on the right in Figure [10.51 



The outcome 

As shown in the figure, the outcome of the experiment is rather different. Instead of 
particles impacting along a broad region of the screen, they only hit two well-localized 
spots. The location of the spots correspond to the following magnetic moments: 



M, 



±7e 



h 



(10.19) 



With 7e is a numerical constant. This is a pretty shocking result if you are a die-hard 
fan of classical physics. Just by sending a beam of ordinary silver atoms through some 
magnetic field, it splits in two very sharp lines! Obviously, it's hard to explain this with 
classical physics. 

Can we understand this result with the spin picture? The answer is yes. First, one can 
show that the magnetic moment of a silver atom is due to only one of its electrons. All 
other electrons pair up in states with opposite spin, and also the total angular momentum 
of all the electrons in silver is zero. This will be explained better in next chapter, just 
buy for now that the experiment really measures the intrinsic magnetic moment of one 
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electron only. If we define the constant 7e to be the electron's (intrinsic) gyroniagnetic 
ratio, equation p(J.19|) readily implies that 

S, = ±^ (10.20) 

Hence, from the Stern-Gerlach experiment, we draw two conclusions 

• The size of the intrinsic angular momentum of the electron is equal to h/2. 

• The direction of the spin is very restrained: it can only point upwards or downwards- 
nothing in between. 

Let us put this in a quantum mechanical style. The electron (ignoring the spatial depen- 
dence of the wave function) can be in two states: spin up and spin down, which we 
denote as follows 

spin up '^ ( n ) ^P™ down '^ ( i ) (10.21) 

We also define the operator corresponding to the z-component of the intrinsic angular 
momentum (the spin operator) to be 

^- f ( J -1 ) ('»-) 

This operator acts on the spin up and spin down states by ordinary matrix multiplication. 
Denoting the spin up and spin down state by | t) and | |), we have 

s\t)^Ui " vn-^(n-?it) (10.23) 



2 \ -I J \ J 2 \0 J 2 
and similarly 

From this we conclude that the spin up state | t) has eigenvalue ti/2 under the operator 
Sz, and the spin down state | |) has eigenvalue —h/2. This corresponds precisely to the 
two spots on the detection screen. When the electron is in the up state, it is deflected 
upwards by the Stern-Gerlach apparatus. When it is in the down state, it will hit the spot 
lower on the screen. 
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sjnn up 

Figure 10.6: Again relying on the naive depiction of an electron as a rotating sphere, 
we can see the spin up and spin down states as two different rotations. Rotating in one 
direction gives spin up, rotating in the opposite direction gives spin down. Again, we 
stress this is a simplistic visualization of the spin. 



Superpositions 

According to the rules of quantum mechanics, all linear combinations of these two should 
be physical states too. So any state of the form 

is a good state too, for any complex numbers a and (3. (If you require normalization, you 
need |ap + |/3p = 1.) What happens to such a state when it enters the apparatus? As we 
have mentioned several times by now, the Schrodinger equation (for whatever system of 
Hamiltonian) is linear. Hence, if a state is the sum of two parts, each part will evolve in 

its own way - as if the other was not there. From this we infer that the all part will 

travel upwards, and the /5 I -. I will fly downwards. Hence the wave function will split 

in two parts. The total amplitudes of the two parts of the wave function are given by |ap 

and |/3p. You know that the measuring device (etecting screen) will then report impact 

on the upper spot with probability |ap , and impact on the lower spot with probability 

|/3p. So actually, we should say that an electron can be: 

o in the up state 

o in the down state 

o in a superposition 

However, for any of these possibilities there will only be one place at which detection 

occurs, so it always looks as if the electron was strictly up or down, even if the state 

entering the apparatus was a superposition. So just by adding the notion of spin to our 

quantum theory, we can successfully explain the stunning experiment of Stern and Gerlach. 
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Playing ^vith spins 

This clarifies the z-component of the spin, but what about the other components, x and y? 
Well, Stern-Gerlach apparatuses are like Lego pieces: you can put them together in many 
ways, and have lots of fun. Consider drilling a hole in the detecting screen, at the spin 
up spot, and behind it placing another Stern-Gerlach apparatus, as in Figure 110.71 The 
outcome is not so surprising: the screen after the second apparatus will only detect spin 
ups. Indeed: the hole served as some kind of a filter - letting through only the | f) states. 
Hence, at a second splitting, there will only be particles going upwards, as confirmed by 
experiment. Now do something special. Keep the first one, but twist the second apparatus 
over an angle of 90°, so that the B-field is in the x-direction. That way it selects states 
by Sx, the x-component of their intrinsic magnetic moment. The outcome is surprising: 
again two spots are seen on the final screen: one with magnetic moment Sx = 21 ^^^ o^^ 

spot ioT Sx = —2- How can this be? Somehow, the state I r, I of definite Sz does not 

have a definite Sx- It is a superposition of an spin up and spin down when viewed in the 
x-direction. A similar result holds when studying Sy. Well, let us now look at it from the 
quantum side. Both Sx and Sy are -like 5*^ - physical quantities: they can be measured. 
They have to correspond to operators acting on the spin state. So, iust like Sz they have 

. . _.a .. . ......0 „..„ .U.. o.. .e ..e. f M . . .e nee. 



/3 



Ul'Y' Lf'T' \ , ,~, I (-*'7/ t-'' 



\ '^x '-'•X J \ y y / 

for complex numbers ax , ■■■ , dy. How can we guess the form of Sx and Sy? Because the 
operators Sx and Sy are observables, they have to be Hermitian. Hence the corresponding 
matrices need to be so too. So we need 

a b \ f a* c* \ /-,„ „„N 

c d) = [b* d*)- (10-2^) 

This is a very restricting property. You can easily check that (up to linear combinations) 
there are only 4 Hermitian 2-by-2 matrices: 

The first one is trivial: the identity matrix. This not really an interesting operator. The 
last one you should also recognize: it's our friend Sz modulo the factor h/2 which has 
been stripped of. A fond guesser might then conjecture that 
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Figure 10.7: Playing with different SG apparatuses. Top: drilling a hole at the spin-up 
spot and placing another appartus after that. The outcome is as expected: only spin-ups 
are detected in the second device. Bottom: when placing a second, but rotated device, 
things become more interesting. Again, two spots emerge, one at Sx = /i/2 and one at 
Sx = —fi/2. This means that a state with spin up along the z-axis does not have a definite 
value for 5^. 



Well, this actually is a good guess. To convince you a bit, we will use the above guess to 
explain the outcome in Figure [10.71 On the first place, 



Sx 



0J~2ll 0/V0J~2ll 



(10.30) 



so clearly ( ) is not and state with definite Sx- However, it can be broken in two parts 



1/2 \/ 1/2 

1/2 ; + V -1/2 



(10.31) 
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And these parts are eigenstates of Sx'- 

^ f l/2\ _ h f 1/2 



This explains the outcome of the 'twisted' Stern-Gerlach experiment. The first part selects 
the state to be in I J . This is an eigenstate of 5"^, but not of Sx- Hence, when passing 

.l„„„gh the second B-field. the state gets spUt agam. The part ( J'^ ) had pos.t„e 5.. 

hence feels a force F^^ > and goes to positive x. The part ( , j has negative Sx, 

so is subject to F^^ < and goes the other way. Similar reasonings hold when considering 
Sz and 5"^ or Sx and By: if an electron is in a definite state with respect to one of the 
three operators Sx, Sy of Sz, it will be a superposition of the up and down states along 
any other direction. 

10.2.3 The Zeeman effect, ctd. 

We can now see why Zeeman's experiment gave a 'strange' outcome. Consider a free 
electron (so not bound in an atom). Put it in a magnetic field: (0, 0, B). The Hamiltonian 
of the electron is: 

H = -B-M = -7ei? • S = --feBSz (10.33) 

Hence, the spin-up and the spin-down states are energy eigenstates: 

H\t) = -7eB^\t) H\i)=^eB^\i) (10.34) 

Their energy difference is given by: 

AE = JeB^ - (-leB^] = jeBh (10.35) 

So in conclusion, when you want to know the energy of an electron in an atom, not only 
do you need to consider the energy shift due to its angular momentum (proportional to 
the quantum number m) you also need to specify its spin state (up or down) to get the 
right expression. If you carefully account for both of those, your prediction very tightly 
coincides with Zeeman's measurements. 

This concludes the story part. You now understand the outcome of both the Zeeman 
and Stern-Gerlach experiment thanks to the concept of spin. Of course, you may not be 
fully comfortable with it yet. To get more grip on it, we will work out some more details 
and concrete examples in the last part of this chapter. 
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10.3 WB Working with spin 

10.3.1 Bra's and ket's 

A nice thing about spin, is that reahzes the bra and ket formahsm in a very simple way. 
For a general spin ket \s) = q| t) + /3\ \.), we define its bra as follows: 

N> = ( ^ ) => {s\ = {a* P*) (10.36) 

The inner product of states is nothing but the inner product of vectors: 

(si|s2) = ( a b)(^\=ac + bd (10.37) 



In particular, the square of the norm of a spin state is given by 

a 

/3 



sif = {si\si) = { a* /3* ) r " ) = |a|2 + |/3|2 (10.38) 



This clarifies our remark in the previous section: a spin state I „ ) is normalized if and 

only if |ap + |/3p = 1. This is a nice moment to write down the spin up and down states 
along the x and y axes. Denoting them by \±)x and |±)j/, we have 

l^)^-(lif^) -1 l±>. = f i.'r% ) (10.39) 



You can verify their normalisation and that 



— \±)x and Sy\±)y = ±-\ 



Sx\±)x = ±-\±)x and Sy\±)y = ±-\±)y. (l0.40) 



Also, it is nice to check explicitly that the inner product between the two eigenstates of 
any Si is zero. This has to be so, since these pairs are eigenstates (eigenvectors) of a 
Hermitian operator (matrix) with different eigenvalues. 

10.3.2 Angular momentum 

If you are very (very) critical, you may remark that the Zeeman and Stern-Gerlach experi- 
ment do not strictly prove the existence of an electron's angular momentum. They clearly 
show an electron has an intrinsic magnetic dipole moment - how else could it get an extra 
energy term M • i? in the presence of a magnetic field? From this observation then, we just 
assumed the underlying reason for this dipole is an intrinsic angular momentum - which 
we called spin. This is a very natural thing to do, but how do we show more directly 
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that this spin reahy is a form of angular momentum? Or more general, if one has three 
operators {Ox, Oy,Oz) what makes one conclude that these give the angular momentum of 
a quantum system? Classically, an object is an angular momentum if it gives an object's 
'amount of rotation' but in quantum mechanics we can not use this pictorial definition. It 
might be good at this point to go back quite a bit, to the point where we computed the 
commutation relations of the angular momentum operators L = {Lx,Ly,Lz)- We found 
that 

[Lx,Ly] = ihLz and cyclic (10.41) 

By 'cyclic' we mean that you can shift all three variables x — )■ y — )■ z to get the other 
two commutators. These commutation relations are actually quite special. They are so 
special, that whatever set of operators satisfying similar commutation relations is defined 
to be a generalised angular momentum. This gives a nice and strict definition of 
angular momentum in quantum mechanics. Do the spin operators S = (Sx, Sy, Sz) satisfy 
the above commutation relations? You bet: with the matrix representation it is easy to 
show that 

[Sx,Sy]=ihSz and cyclic (10.42) 

Hence, we can comfortably state that spin really is a (generalized) angular momentum, 
just because of the structure of the commutators of its three components. 

In analogy with the total angular momentum operator L^, we can also write out the 
operator 5^: 

S^ = SS = ^ S.S. = f^ ( J ; ) = .a 1 ( 1 + i) ( J J ) (10.43) 

So every spin state \s) is an eigenvector of S"^, with eigenvalue ^^f (f + l): 

^'k> = ^'^Q + l)k)- (10.44) 

The reason to write 3/4 in a funny way, is to make clear the (formal) resemblance to the 
atomic energy states, where we found 

L^\n,i,m) = ff£{i + l)\n,£,m). (10.45) 

So just like we labeled \n, i, m) with an orbital angular momentum quantum number I, spin 
states can be interpreted as carrying an intrinsic angular momentum quantum number 
1/2. Because this value is the same for all spin states (all spin states are eigenstates of 
S"^ with the same eigenvalue) this is an intrinsic property of the particle itself. People say 
that the electron is a spin-^ particle. 
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10.3.3 Quantum Larmor precession 

Ah, we have arrived at our final computation. We start out with a small question. The 
operators Sx, Sy, and Sz describe the spin of a particle along three main axes. What 
about all other directions? In spherical coordinates, every direction can be described by 
a unit vector u = (sin cos 99, sin ^ sin 99, cos 9). Hence it is natural to define the operator 
of the spin along direction u as: 

^ -. ^ n f cos^ sin^e-*'^ \ 

A trivial example: the x-direction is given by angles 9 = ^, ip = 0. This corresponds to 
the unit vector Ux = (1,0,0), and the associated operator written out above is nothing 
but Sx- For a general direction u and the corresponding operator Su, the spin eigenstates 
along that direction are given by 

e-*^/2 pos(^/2) \ . \_y _ f -e"*^/2sin(^/2) 

e^^/2sin(0/2) ) a^^ I )«-\^ e*^/2cos(e/2) 



!+)«-( J^/2^^^(n/.^ and |-)„ - ,^/2^„.,^/o^ ) (10-47) 



Again, a trivial check: for the z-direction, 9 = ip = 0, and the above expressions give 
|+)2 = I n ) ^^^ l~^^ ~ ( 1 ) ^® should be. We have now all necessary tools to 

describe the quantum mechanical version of Larmor precession. Let us take a state, which 
is spin up along a direction u: 

mt = 0)) = \+)u (10.48) 

6-^^/2 COs(0/2) \ 

e^^/'smi9/2) ) ^^°-^^^ 

'*'^/2 cos(0/2)|+), + e^'^/2 smi9/2)\-)z (10.50) 



We now want to study how this state evolves through time, given that we put a magnetic 
field (0,0, i?). Well, earlier we found that the magnetic field gives an energy E^ = 7e-B^ 
to the up state and E~' = — 7e-B| for the down state. The Schrodinger equation i9(|^) = 
-ff IV') then tells us that the time evolution of the above state is given by 

|^(t)) = e-^'^/2 cos(0/2)e-^^+*/'^i+), + e^^/^ sin(^/2)e-^-^"*/V>. (10.51) 

From this, we can compute the expectation value of the different spin components: 

h 



{Sxhit) -- 


= (V(t)|5,|V(t)) = -sin0cos((^ + a;ot) 


(10.52) 


{Sy)4t) -- 


= m)\Symt)) = ^ sin 9 sm{^ + uJot) 


(10.53) 


{SxMt) -- 


= m)\s,mt)) = hos9 


(10.54) 
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Figure 10.8: The spin of a particle does not point. Even though the expectation values 
define a vector (which does point) one would draw several incorrect conclusions from 
treating spin as a vector. The most obvious pitfall is concluding that Sx\+)z^ — by 
projecting the spin on the x-axis: this is of course wrong. The right decomposition is 
shown on the right. Clearly this is not the way ordinary vectors work. 



Where wq = ~/j~ = 7e-S, which is nothing but the classical Larmor frequency. The above 
result is very nice: if you look back, you see that the expectation values of the spin com- 
ponents have the same time dependence as the components of the classical counterpart. 
Such similarities show up quite often in quantum mechanics. Recall for example how the 
(semi-)classical radius of the electron orbit (the Bohr radius) was quite good an indicator 
for the size of the correct quantum wave function. In general, classical intuition can often 
'predict' the evolution expectation values of systems, and orders of magnitude. But this 
is only a guidance. Equally often, the classical intuition is of no good at all - think of the 
tunneling effect for example. 

Also with spin, one should be careful. The above result on Larmor precession may 
seems so natural, that one would almost think that the naive picture of spin (an electron 
rotating around an axis) captures the situation accurately. Although this would make the 
concept of spin more easy to grasp, this idea really is incorrect. Spin is a bit more tricky 
than that. Take a look at Figure [10.81 Say there is an electron with spin up along axis z. 
You may be tempted to represent this as an arrow pointing upwards along z. (And such 
is done quite often.) This is not bad, but some students tend to infer from this drawing 
that the components of the spin along axis x are zero. So they might write 



SJ+) 



0. (WRONG) 



(10.55) 



This is very wrong: we have seen in this chapter that the spin state - expressed in the 
basis of the x direction will be a superposition: 



i+'= = ;?2i+>-+7i' 



(10.56) 
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so that 

&!+). = ^^|+). + (-f)^|-).^0 (10.57) 

So if you really want to use arrows, make sure you understand the right side of Figure fTOTSl 
But even then, you might get into trouble when trying to draw complex spin components - 
so there probably all intuition breaks down. The safest thing is to remember the following 
(not so catchy) catchphrase: 

The spin of a particle is represented by two (possibly complex) numbers. You 
can decompose this as a linear combination of a spin-up and a spin-down state, 
with respect to any direction. In particular, if the state is up (or down) along 
a direction n, then it is eigen under the operator Su with eigenvalue h/2 (or 
—h/2). However, this does not mean this spin points up (or down) along u. 
Spin does not point in any direction; it is a state - not a vector. 

Especially the last sentence is important. You might refute this: don't the expectation 
values of S define a vector for each state - like in the equation (jl0.54p ? This thing points, 
right? Well, that is very true, but then recall that a state is more than just it's expectation 
value. In a similar way, we can associate to each wave function a well-defined number (x), 
but this does not yet mean a particle is a point. 

So this makes spin a bit subtle to understand. There is a pretty common saying that 
there is nothing we understand in life - we just get used to things. That is probably true 
here too - and don't worry, you will get used to the idea of spin at some point. 

Application: Sunspots 

Due to the high temperature, many of the atoms in the solar atmosphere are excited or 
even (partly) ionized. As a result, there is an abundant absorbing/emitting of photons 
by the electrons kicked up/falling down from one level to another. This is visible in the 
light that reaches: the spectrum has so-called spectral lines (less intensity at specific 
frequencies). It is possible to identify which lines are due to which elements and hence 
deduce the composition of the sun without actually having to go there. (Historically, this 
is how we got to know the sun is composed of mainly hydrogen and helium. Before it 
was thought the sun had a composition similar to earth.) However, if you look at those 
spectral lines carefully, you can see that the degenerate lines (coming from transitions 
between degenerate energy levels) are actually split! With what you have learned, you 
would guess that the sun has a magnetic field, and that the splitting you are seeing is 
actually the Zeeman effect. This is indeed true. By doing precise measurements, you can 
observe that this magnetic field varies along the solar surface. Maybe you have once heard 
of the small but numerous dark spots on the sun (not so imaginatively called 'sunspots'). 
It turns out the magnetic field is significantly larger at these places. This very crucial: 
the darkness of the spots turns out to be caused by the magnetic field, which influences 
the dynamics of the gas and plasma around it - in a way that the temperature is lowered 
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which in turn makes that region shine less bright. The exact dynamics of the magnetic 
field of the sun and the formation of sunspots is a vast field of research; and the underlying 
experimental observations would definitely not be possible without (an understanding of) 
the Zeeman effect! 
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Exercises 

1. Explain to yourself (without looking at the text) how the Zeeman and Stern-Gerlach 
experiments work and why the lead to the idea of spin. 

2. In the chapter on the uncertainty principle, you learned that non-commuting ob- 
servables are incompatible. How is this related to spin and the 'twisted' version of 
the Stern-Gerlach experiment (Figure FlO.Tp ? 

3. Check that the commutation relations of the spin operators satisfy the angular mo- 
mentum commutation relations. Are these still satisfied if you absorb an extra 
factor - say multiply all three matrices by 2? (Now look again at the transition 
(jlO.lOp — )- ()10.20p . Could you also have taken 7e only half as large, and Sz = ±^?) 

4. Check the completeness relations for the basis {|+)u, |— )«} 

5. Find the eigenvalues and eigenvectors of the matrix pu.46p . Do you find the vectors 
stated in the text? 

6. A typical value of the B-field in a solar spot is 0.15 Tesla. Ignoring angular momen- 
tum, what is the difference between the two spin states of an electron of an atom near 
that spot? (The size of the gyromagnetic ratio of the electron spin is 7e = 1.76 • 10^^ 
g-irp-i -J ^Yj^at is the frequency of the emitted photon when the electron goes from 
one spin state to another? 

7. Check the following property of the Pauli matrices: aacrt = 6ab-I + iJ2c ^ahc^^c where 
I is the 2x2 identity matrix. With this, prove {a ■ a){h ■ a) = (a ■ b) I + ia ■ (ax b). 

8. It is natural to guess that the total angular momentum of an electron is given by 
J = L + S: just the sum of its orbital and intrinsic angular momentum. Because 
J works on the wave function, and S on the spin state of the electron, [S, L] = 0. 
Show that this implies that J is indeed a angular momentum operator (i.e. that it 
satisfies the right commutation relations). 

9. Given a set of operators Jx, Jy, Jz which satisfy the angular momentum commutation 
relations, define the following two operators: 

J+ = Jx + iJy and J- = Jx — iJy 

. Show that [Jz,J±] = ±/iJ±- Show also JzJ±\m) = h{mzizl)J±\m) if m is an 
eigenstate of Jz with eigenvalue hm. Interpret this result. (Hint: think back of the 
situation with the creation and annihilation operators for the harmonic oscillator.) 



Chapter 11 

Many particles 



In this chapter. . . 

We are about to explore a topic left untouched so far: describing the quantum behaviour 
of collections of particles, instead of just a single one. An important concept governing the 
dynamics of such systems is the notion of identical particles. This will lead to a division of 
particles in two types: bosons and fermions. These concepts go a long way understanding 
the structure of larger systems. It will soon become clear that they have very direct 
consequences on the touch and feel of our own (macroscopic) world. 
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11.1 [jzJ Product and sum of vector spaces 

We will give a short review on the direct sum and direct product of vector spaces. You 
might have seen these before, but for what follows, it's useful to have it all fresh in your 
mind. 

As an ode to those golden times of high school math, we will illustrate the two concepts 
with an example involving marbles - oh yes. Imagine you have a collection of marbles. 
They are of three kinds: yellow, blue and red. You can summarize the content of your 
collection with three numbers: (ny,nij,nr)- These quantities give the total number of 
yellow, blue and red marbles you have. The space of all possible {uy, n^, n,.) forms a vector 
space. (We ignore the issue of these numbers being integer or not.) For example: (1, 2, 2) 
and (2, 4, 1) give two possible marble collections, but also their sum (3, 6, 3) describes a 
possible collection. The same for multiples. Imagine you are even more lucky - and not 
only have marbles, but also three empty jars. Woah, full of enthusiasm you put your 
marbles in the three jars. That sure is some good fun, but a sudden problem strikes you. 
How can I now describe my collection, now that it is in the jars? Here is a first way to 
do so. On top of specifying {ny,nh, rir), you can give another vector, which gives the total 
weight of marbles in each jar: {wi,W2,W3). These numbers too form a vector space, by 
very similar arguments. Better even, your collection (and how it is put in the jars) can be 
compactly summarised by one long vector: 

{ny,nh,nr,wi,W2,W3). (11.1) 

This just glues the two pieces of information together in one single object. That is possible 
because the two pieces of information are completely independent: the number of marbles 
per color and the weight of the marbles in each jar have nothing to do with each other. 
Making a new vector space in this fashion, is called the direct sum of vector spaces. 

Being satisfied with this answer, you are just about to go and play with the marbles 
and jars, but another issue strikes you. You could also describe the marbles in the jars 
in another way. Why not say how much marbles there are in each jar, per color? So 
something like: 

Wi^y Wi^r \ 

W2,y W2,r (11-2) 

Wz,y Ws^r J 

with for example W2,y the total weight of yellow marbles in jar 2. In this alternative 
description, we get more information on the system. This is because we are not keeping 
track of the weights in jars and the types of marbles separately anymore: we give mixed 
information on how many of each type is in each jar. This new vectorspace (in the form 
of 3-by-3 matrices) is called the direct product of the original two vector spaces. 

What you should remember from this somewhat pathetical example is the following. 
Every pair of vector spaces can be combined into a new (larger) one. There are two ways 
to do this. The first (direct sum) glues vectors together. It means that you keep track 
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of both vectors (marble types and jar weights) in a rather trivial way. These two bits of 
information need not have any relation at all. The second possibility is taking the direct 
product. In that case, you keep mixed information (here: how much of each of the marble 
type is in each of the jars). This construction is suited for situation in which you want to 
keep track of a finer sort of information. 

In a slightly more mathematical language: let there be given two vector spaces V and 
W of dimension n and m, with bases vi, . . .Vn and wi, . . . Wm- The direct sum of these 
vector spaces is another vector space, denoted V © W, with basis 

{vi,...,Vn,Wi,...,Wm} (11-3) 

Obviously, this vector space has dimension n + m. Next to this, we can also construct the 
direct product, denoted V (8) W. This space has as a formal basis: 



ViWi, V1W2 ■ ■ ■ ViWr, 



VnWi, VnW2, ... VnWm 



:ii.4) 



and has dimension nm. 

11.2 B Many particles 

So here we start our quest to understand systems with more than one particle. For 
concreteness, consider two particles. How can we describe those two together? Since both 
of them are wavepackets {ipi and tp2), we could guess that the total system should be 
described by the sum of the two wavefunctions i^i + ip2- Something like this is suggested 
in Figure 111.11 However, such a description quickly runs into trouble: how could we then 
distinguish between those two particles and just one particle of which the wavefunction 
just happens to consist of two lumps? We need something else. A good starting point is 
the following question: what if we measure their positions? In that case we get a concrete 
outcome like (xi, X2). Here xi is the measured position of one particle, and X2 the detected 
position of the other. From our experience with single particle systems, we can guess that 
such an outcome is not certain, but only has a probability associated to it. From this we 
guess there must be a function 

1p{xi,X2) (11.5) 

such that \i^{xi,X2)\'^ is the probability density to measure the particles' positions to 
be (xi,3;2). This turns out to be the right description. So when we correctly want to 
describe two particles at once, we should not consider two separate wavefunctions, but one 
collective wavefunction. As an illustration: the probability to measure ai < xi < bi 
and simultaneously a2 < X2 < 62 is given by 

f-xi=bi /■X2=b2 

P(ai < xi < 61 & 02 < X2 < 62) = / / \^l^{xl,X2)\'^ dxidx2 (11-6) 

Jx^=a^ J xy,=a9. 
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Figure 11.1: Left: if the wavefunction ■012 of two particles would be just the sum of 
the wavefunctions, such a state could not be distinguished from a single particle whose 
wavefunction happens to consist of to lumps. Right: if we describe the two-particle system 
with a function '(/;i2 = ip{xi,X2) (real part shown) this problem does not occur. So this 
gives the right description. 



Hence, the normalisation of a collective wavefunction takes the form 

\^lJ{xl,X2)\'^ dxi dx2 = 1 



oo />oo 



:ii.7) 



oo J — oo 



Because a collective wavefunction has more arguments (here two: xi and X2) it is some- 
what harder to visualise than a single wavefunction. An attempt at drawing a particular 
ip{xi,X2) is made in Figure [1 1.1 1 on the right. 

11.2.1 Direct product 

We have just seen that the wavefunction of a system of two particles keeps track of both 
particles at once, not just the conjunction of two individual wavefunctions. If you got the 
marble story, you might smell a direct product coming up here. Say that a basis of the 
Hilbert space of particle one is given by |xi)) •••) IXn), and a basis for the states of the 
second particle by \(f)i) , . . . , {(pm) ■ Then the combined system of the two particles has as 
basis 

|Xi)|0j) with l<i<n,l<j<m. (11.8) 

The state \xi)\4'j) is understood as a situation in which the first particle is in state \xi) 
and the second is in \(j)j). The inner product between two collective states is defined in 
the following natural way: 



i{Xi\{<Pj\){\Xi')\<Pj')) = {Xi\Xi'){<Pj\(l>j' 



;ii.9) 



But this is not enough: we need the combined states to form a Hilbert space. So we 
also want to allow linear combinations of the above basis of states. This means the most 
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general state of the two-particle system can be written as 

\^) = Y.Cij\xi)\^j) (11.10) 

(By J2ij "we mean Y17=iYlT=i o^ course.) For this 1-0), the chance to measure the first 
particle to be in state 1^,) and the second in \xj) is given by 

P(l : IXi) & 2 : I,/.,-)) = |Q,f (11.11) 

Hence ip is normalised if 

Y^\c^j\^ = l. (11.12) 

We can now better understand the collective wavefunction ip{xi,X2) from above. A (con- 
tinuous) basis we can take for any particle is the position basis |x). Hence, we can take 
as a basis for the collective states all products |xi)|x2). These are wavefunctions sharply 
peaked at one specific xi and X2- So just like 



dx 'ip{x)\x) (11.13) 

for a single particle, we have 

r f 

dxi dx2 i/j{xi,X2)\xi)\x2) (11-14) 



for two particles. So the collective wavefunction we were talking about before, is just the 
position representation of the combined system: 

V'(X1,X2) = ((X1|(X2|)|^) (11.15) 

This means we can also obtain (J11.14p by inserting two completeness relations: 



dxi\xi){xi\ / dx2\x2){x2\ 

dxi dx2 il^{xi,X2)\xi)\x2) (11.16) 

Before moving on, we want to point out a very classical pitfall. Although this is so for all 
basis states, not every state can be written in the product form {ip) = \x)\4')- We illustrate 
this subtlety with an example. Consider two particles, each of which can be in states \a) 
and \b). For example, having both particles in state a is written as 

\a)i\a)2 (11.17) 
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with the subscript denoting we are talking about particle 1 and 2 respectively. In the same 
way, the system can be in state 

\a)i\b)2 (11.18) 

The system could also be in a (normalised) superposition of the above two: 

4f |a)i|a)2 + -7|l«)il^)2 = l«)i (4f l«)2 + 4f 1^)2) (11-19) 

This is a product form again! Indeed, if we rename the state (|a) + |6))/v2 as |c), the 
above state is just |a)i|c)2. In contrast, you can show that the state 

-^|a)i|a)2 + -^|6)i|6)2 (11-20) 

can never be rewritten as a product |(i)i|e)2 - whatever definition of d and e you would try. 
So if someone says you a system of two particles is in a state {ip) you can not in general 
decompose this state as {ip) = \x)\(p)- The only decomposition that is always possible is 
(fTLTH . 

11.2.2 Identical particles 

In the previous section, we did not at all specify which two particles or systems we were 
talking about. They could have been anything. Let us be more specific here, and say that 
the two objects under consideration are two particles of the same species: so two protons, 
or two electrons, two muons - whatever. You may or may not have thought about this 
before, but there is a very special thing about particles. They don't have hair and they 
don't wear clothes, so if you see two particles of the same species, they are really identical. 
To illustrate this, imagine studying two electrons. The chance to detect electron 1 around 
some X and electron 2 around x' is proportional to the probability density: 

P(l : X &2 : x') oc |V'(x,x')P (11-21) 

In a similar fashion, you would say that the chance to find electron 1 around x' and electron 
2 around x to be like 

P(l : x & 2 : x) oc |V'(x', x)^ (11.22) 

This could a priori be two different probabilities. However, if you think more critical, 
you may remark that identical particles neccesarily are indistinguishable. If you detect 
one electron at site x and another at x' , how would you say which ones they are? The 
names that you gave them (electron 1 and electron 2) are not written on them, so if you 
do measurements, you can never tell which detection corresponded to which electron. So 
purely on the basis of consistency, we need to have 

|V'(x,x')p = |V'(x',x)p for identical particles. (11.23) 
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Indeed, both quantities describe one and the same measurement outcome (an electron 
found at x and an electron found at x') so they have to yield equal probabilities. So we 
conlude 

iplXjX ) = r]ijj{x ,x) (11.24) 

with T] a complex number of modulus one. In fact, by interchanging twice, we get the same 
state again, so r/^ = 1 implying r] = ±1. Inspired by this result, we define the exchange 
operator P. Take again two identical particles. If particle 1 is in state \x) and particle 
2 in state |0), then P acts as follows: 

P\x)i\<P)2 = \<P)i\x)2 (11.25) 

That is, P puts particle 1 in the state of particle 2 and vice versa. Obviously, P^ = 1: if 
you interchange the states of the particles twice the state is again as before. From P^ = 1, 
it follows that P can only have eigenvalues ±1. (Indeed, if P\ip) = X\ip), and P^ = 1 then 
1^) = P'^\ip) = A^l^) so A^ = 1.) If the eigenvalue under P of the pair of identical particles 
is —1, we call them fermions, and if their eigenvalue is +1, we call the particles bosons. 
So 

lx)i|0)2 = \(p)i\x)2 (bosons) (11.26) 

Ix)i|0)2 = -|0)i|x)2 (fermions) (11.27) 

This is a very fundamental classifications of elementary particles. Electrons, neutrons 
and protons are all fermions. Photons are bosons. As you know, there are many other 
particles, and all of these are either bosons or fermions. Let us consider fermions as an 
example. The two-particle wave function is: 

dxdx i{j{x,x')\x)i\x )2 (11.28) 



and since |x)i|x')2 = —\x')i\x)2 only the antisymmetric part of tp contributes to the 
integral. So without any loss we can keep only the antisymmetric part of 'ip{x,x'): 

iIj{x,x') = —ip{x' ,x) (11.29) 

This way, we see that r] = —1 for fermions. In a similar fashion, ij = 1 for bosons and 
ip{x, x') is symmetric in that case. So for either bosons or fermions, the chance to detect 
particle 1 here and particle 2 there is equal to the chance to detect particle 2 here and 
particle 1 there - as should be for indistinguishable particles. 
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X, X2 X, X2 

Figure 11.2: Two particles: one is at location xi and the other is at location X2- Be- 
cause the particles are identical, this can not be distinguished from a situation where we 
interchange the particles' positions. This has important consequences for the cohective 
wavefunction: it should either be symmetric or anti-symmetric in terms of the position 
variables. 



11.2.3 The Pauli exclusion principle 

So far we have only considered two particles. We will not consider more than two particles 
in a general way. This is a bit sloppy - expressions with many indices and that sort of visual 
loitering- and it gives few extra extra insight: all rules are just a quite straightforward 
extrapolation of the above conclusions. Taking direct products, bosons vs. fermions, 
minus signs popping up, that kind of things. So we skip the general treatment here. 

A special case we will treat is a system of A^ fermions, all of the same kind. If every 
fermion separately can be in states {i/ji), then the Hilbert space of the entire system is 
spanned by 

\i;i,)\iji,)...\i;i^) (11.30) 

The above state describes a situation where fermion 1 is in state \ipii), fermion 2 in \1pi2), 
etcetera. Now by the definition of a fermion, exchanging the states of two fermions is 
minus the original state. So for example, interchanging the states of fermion 1 and 3, we 
get: 

IV'n)l^i2>IV'i3)"#iiv) = -|'0J3>lV'i2)l^n>--#iiv) (11.31) 

Up to the sign, the right hand side is to be read as: fermion 1 is in state i^ and fermion 3 
in state ii. Once more, note that this rule is not so strange: both above states yield the 
same observations (you'd measure indistinguishable electrons in states ii,...,ijsr) so they 
have to be identical up to a phase factor, which by definition is -1 for fermions. 
From the above equation, we conclude that there is some redundancy in the states ()11.30p . 
Any two combinations that are a permutation of the individual states have to be propor- 
tional to each other. So many of them are just the copies of another up to a possible 
sign. 

But there is a more striking remark. Suppose the system is a state like (|11.3Up but 
now we put fermions 1 and 2 in the same state, so zi =12- Hence the total system 
is \'ipi-^)\jpi-^)\ipi^)...\tpij^) . If we blindly interchange the states of fermions 1 and 2 (even 
though they are the same) the above rule tells us we get a minus sign: 

|Vn)|^n)IV'^3)"#.iv> = -\i^^,M,m,)■.■\i^^^) (11.32) 

This is a bit of a funny equation: an object can only equal its opposite if it is zero. So a 
state with ii = 12 can not exist! In general, we conclude: 
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It is impossible for two fermions (of the same species) to be in the 
same quantum state. 

This is a very profound statement, with far-reaching consequences (we will meet some 
later on in this chapter). It was first formulated by Pauli, and is called the exclusion 
principle. 

In the unlikely case you mix up the meanings of bosons and fermions, you can use a 
trick. A fermion is firm (it 'repels' other fermions) while a boson likes to get close on 
(it likes to sit together with other bosons). OK, there are better sounding rhymes in the 
world but well. Time to use all these new goodies for some applications. 

11.3 lizJ Applications 

With the ideas and techniques presented in the theory part, you can say a very great deal 
about larger physical systems. By 'large' we mean: an atom with two or more electrons, a 
molecule, or even an large piece of metal. The Pauli exclusion principle is the final piece of 
the quantum mechanical jigsaw that we have been putting together in this course. Below 
are some nice applications. 

11.3.1 The atomic structure 

You know that the energy eigenstates of a single electron in an atom are given by \nim). 
In the absence of a magnetic field, the energy depends on n only. If an electron is in 
a high energy state n > 1, it will typically fall down after a while, until it has reached 
n = 1. With each transition, the energy difference is released in the form of radiation. So 
a hydrogen atom at rest will typically be in the n = 1 state, the ground state. Now what 
about larger atoms? These have more electrons in them, and the nuclear charge is now 
Z j^ 1 but it turns out you can still find states of the form |n,£, m). A bold guess would 
be that in such systems then, all electrons would sit in the lowest energy state. So you 
would write 

1^) = |100)i...|100)jv (WRONG) (11.33) 

In words: the system is given by all (N) electrons sitting in the lowest energy state 1 1 0) . 
However, this is not what Pauli taught us to do. No two electrons can sit in the same 
state, so the above configuration is just not possible. If no two electrons can sit in the same 
state, what is the best we can do (energy- wise)? Ah, just filling up the n lowest energy 
states of course. Also, we know that electrons have a spin degree of freedom. Hence, in 
each energy level, there can be a spin-up electron and a spin-down. As an example of this, 
consider the Lithium atom. Two electrons can sit in the 1 1 0) state, if at least they have 
opposite spins - this way they are in a different state, and the exclusion principle poses 
no problem. The third electron however is the odd one out: we can not do anything but 
putting it in a higher energy level. So the ground state of the 3-electron system of Lithium 
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is given by 

\i;)Li = |100 t)i|100 i)2|200s)3 (11.34) 

Here, we have just put the spin states as an extra parameter in the Ust of quantum 
numbers. Note that the spin state s of the third electron can be whatever it hkes to be 
- any superposition of up and down is allowed. Also, the spin up and down (of the first 
two electrons) can be with respect to any direction - as long as they are opposites, they 
describe states without overlap. In principle, the above notation is fine, but one has to 
keep in mind that it is equals to — 1100 4-)i|100 t)2|200s)3 because this just interchanges 
states of electrons 1 and 2. To prevent any confusion, people like to write states explicity 
in an anti-symmetrized form. Here, this amounts to summing up all six permutations, 
with the right signs, and each with an individual weight l/\/6: 

1^)^. = ^|100t>|100i)|200s) -^|100t)|200s)|100i) 
v6 v6 

+ ^|110i)|2 0s)|100t) - ^|100i)|100t)|200s) 
vG v6 

+ ^|200s)|100t)|100i) -^|200s)|100i)|100t) 
v6 v6 

In principle, this is just the same state as the one up higher, but it is explicitly anti- 
symmetrised, and hence we can freely drop all subscripts denoting the particle numbers, 
these are not relevant anymore in the above notation. In a similar fashion, we can construct 
the states of the electrons in arbitrary large atoms. This 'stacking' of electrons in energy 
levels explains a very wide range of chemical properties of elements. As shown in Figure 
111.31 it can describe the configuration of electrons around larger atoms. These kind of 
diagrams can in turn be used to explain which elements can be bound to which others, 
and so on. So combining the exclusion principle with our earlier results opens up the vast 
world of chemistry. That's quite pretty... . 

Of course, there are some things one should bear in mind when considering larger 
atoms. There are several effects that shift the energy levels of the states. These make the 
energy of a state depend on all quantum numbers, not just n. So the most economic way 
of filling an atom's orbitals becomes a bit more subtle. You may remember some rules 
from chemistry that tell you in which order to fill the orbitals. These rules are precisely a 
consequence of the energy shifts we are talking about. Here we mention two main effects 
(in order of importance) determining the shifts and splittings of the energy levels: 

Eflfect 1: Coulomb repulsion 

In larger atoms, the Coulomb force is not acting between the nucleus and the electrons, but 
also amongst each pair of electrons. This is an extra energy contribution, and hence one 
needs to put the corresponding term into the Hamiltonian. Physically, what is happening, 
is that the electrons in low energy states (small n and small I) are covering up a part of 
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n=i: 

s-or6itaf: 

li,o,oA> li,o,o,i> 

s-or^itaf: 

l2,0,0,\> l2,0,0,i> 

■p-or9itaf: 

l2,i,-i,\> l2,i,-i,i> 

\2,1, 0,\> l2,l, 0,i> 
\2,1, 1, t> l2,l, 1, l> 

n=3: 

s-orSitaC: 

l3,o,o,\> l3,o,o,i> 



Figure 11.3: The atomic structure of magnesium. The twelve electrons sit in the lowest 
possible energy states. On the left these are pictorially shown as Bohr orbits. The correct 
description involving states is shown on the right. States with the same n are called a 
shell, and states with the same n and i comprise a subshell. So for magnesium two shells 
(n = 1, 2) are completely filled and one shell {n = 3) is partly filled. 



the charge of the nucleus. So electrons in more extended wavefunctions (larger n or larger 
i) see a lower effective nuclear charge. An approximate equation expressing this, is 



^eff 



Z-S 



(11.35) 



with Zefj the nuclear charge, seen by an electron that has S electrons below itself. This 
is called the shield effect. It is shown in Figure 111.41 Hence, the binding energies of 
electrons in high orbits are lower than one would expect. Indeed, a lower (effective) nuclear 
charge means that they are less attracted to the nucleus, and hence less well bound. Also, 
for states with the same n but a different i will not have the same energy any more. A 
larger i corresponds to a more extended wavefunction (recall from your chemistry class 
the dumbbell shape p-orbital versus the spherical s-orbital) and will be more strongly 
experience the shield effect. 



Effect 2: Spin-orbit interactions 

Recall that the angular momentum L of a state creates a magnetic moment. For example, 
we found a Zeeman effect, even when not including spin. But there are also magnetic 
moments due to the spin S of the electrons. Because any pair of magnetic moments tend 
to align, there is an extra term in the Hamiltonian of the form —LS, which is called the 
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Nucleus 

charge: Ze 




Effective nuclear charge: (Z - S) e 



'Lower orbit' electrons 
charge: S (-e) 

Figure 11.4: The shield effect, as experienced by the outer electron in lithium. The 
electrons in the lowest energy states cover up the nucleus charge +Ze. So for the outer 
electron, the effective nuclear charge is {Z — S)e, where S is the number of shielding 
electrons. Here Z = 3 and S = 2. Note that S depends on the particular electron we are 
considering. 



spin-orbit coupling]}] Actually, even in the case of just one electron, this term is present. 
The effect of this term is less drastic than the shield effect, but for precise results, one 
should include it. The structure of atomic energy spectra obtained when including this 
effect is called the fine structure. Typically, it will involve small shifts of energy levels 
(and splits the degeneracy with respect to the spin) which can be observed in precise 
experiments. 

Consequences 

There are several consequences of the above effects. From the physics side, we see that 
the structure of electronic states is more involved than just antisymmetrising the single- 
electron results. Most important, the energy levels turn out not to depend on n only. 
In the shield effect, the angular momentum plays a role, and upon including spin-orbit 
coupling also the spin state becomes relevant. In the end, few is left of the degeneracy 
that we might have expected. Actually this is not only an effect on larger atoms: spin- 
orbit coupling also takes place in ordinary hydrogen, so even in that situation the fine 
structure is non-trivial: there is a very small splitting of the energy states of equal n. We 
stress again that the Coulomb interaction between the electrons is the largest effect. Since 
this is absent in the single electron case, the result there is only subject to the spin-orbit 



^The word 'coupling' is a very popular term to indicate that two quantities influence each other. In 
the context of quantum mechanics, it means that the there is a term in the Hamiltonian involving the two 
quantities. This means that the energy depends on the relative values of those quantities, so they 'feel' 
each other, they are not independent degrees of freedom. 
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Figure 11.5: Classical depiction of the spin-orbit coupling. Both the orbital angular 
momentum L of the electron and the intrinsic angular momentum S lead to a magnetic 
moment. These two magnetic moments tend to align. So a parallel state as shown here 
has lower energy than when the two are anti-parallel. This means states with the same i 
but different spin have different energies. With precise measurements these (very small) 
energy differences can be measured. 



interaction. So only if one really does precise measurements, the spectral lines of hydrogen 
are split. But to a very good accuracy, the energy levels are given by the n-dependent 
result we found in chapter 6. The size of the splitting is about a thousand times smaller 
than the energy differences between states with different n. 

Prom the physicist 's side this means that much more work needs to be done to describe 
many-electron atoms properly. In practice, one will have to rely on numerical techniques, 
such as the Hartree-Fock methods. But to apply those techniques, one really needs to 
know what one is doing. So there is also quite some theoretical input. In conclusion, the 
study of atoms is a rich field. If you want to know more about it, be sure to take a class 
on atomic physics once. With this course here, you sure have enough background to 
embark it. 

11.3.2 Application: Lasers 

As you know, electrons in high energy states tend to fall back after a while, emitting a 
energy: a photon. For this reason, energy levels that are not the ground state are called 
excited states. The energy of the emitted photon from a transition between states is 
given by 

AE = Ei- Ef (11.36) 

where Ei and Ej stand for the energy levels of the initial and final state of the electron. 
Such a process is called spontaneous emission. There is however a trick to make this 
spontanteous decay process happen earlier. If you send a photon onto an atom that 
is excited, and this photon has exactly energy AE, the decay process will occur right 
away. That is: two photons will instantly leave the atom, both of energy AE. This 
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phenomenon is called stimulated emission. The process of stimulated emission was 
predicted a long time ago, by Einstein. This process was soon verified some years after, 
and soon raised the interest of many scientists for practical applications. What if you 
would be able to put N atoms all in the same excited state. If one of them decays, the 
escaping photon would trigger all others to decay as well. This cascade process would 
almost instantly release N photons, all of the same energy AE. The result would be: an 
intense flash of light, consisting of photons that are all of identical energy (= identical 
wavelength). Repeating the process, you can get a continuous beam of very 'pure' light. 
Since the photons all have the same phase and frequency, such a source is called coherent 
and monochromous. This invention got the name LASER: Light Amplification by 
Stimulated Emission of Radiation. Since the realisation of the first lasers, it has become 
an important player in eye surgery, metal cutting industry, measuring devices, printers, 
CD and DVD readers, and so on. Why do we mention this here? Well, to stress that 
this important invention would never work if photons were fermions. For each individual 
flash, the photons exiting the laser are all in the same state: same energy, same position, 
etcetera. This would be impossible under the asiocial behaviour of fermions. Only bosons 
like to sit together in the same state, fermions just can't. So the next time you watch 
your favourite high definition action movie DVD, be sure to thank those photons in the 
reading beam for being sociable bosons, otherwise we might still be working with those 
charming but impractical black tapes of our parent's generation! 
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Figure 11.6: A possible design of a laser. A flash lamp (purple light) lifts the atoms below 
it to an excited energy level. The decay may consist of several steps: typically first a 
small step (no visible light) and then a larger one - for example emitting red light. By 
stimulated emission, each photon rips out an entire cascade of identical photons of all the 
excited atoms met on the way, giving a very monochromous and coherent light beam. 
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Exercises 

1. Without looking, try to reconstruct as much as you can of the reasoning which leads 
to the distinction between bosons and fermions, and their defining property. 

2. In a many-particle system, the Hamiltonian will depend on the positions and mo- 
menta of all the particles. Also, the Hilbert space will be bigger. However, the 
Schrodinger equation still holds. Convince yourself that the postulates of Quantum 
mechanics still apply despite these generalizations. 

3. In the text, the inner product of the product space was defined, based on the inner 
product of the single particle spaces. How would you have defined it if we had taken 
the direct sum of the spaces? 

4. Someone is arguing with you whether quantum mechanics is different from classical 
mechanics. He goes: 'Ok, particles are tiny pieces of wave. But there are waves in 
classical physics too'. Explain to him that (as just one possible counter argument) 
the quantum physics of many particles is not a classical theory of waves. (Hint: look 
back at Figure III. 1[ ) In some sense, the conceptual shift to multi-particle quantum 
mechanics departs much further from our intuition than the step we made to describe 
a single particle - luckily at least the mathematical framework stays the same in this 
step. 

5. Symmetrize the state |a)|6)|c)|d) (four bosons), respecting normalization. You may 
assume that \a), \b), |c), \d) are all properly normalized. 

6. Consider two fermions, both in a harmonic potential. The Hamiltonian is then 

H = H, + H, = -^Dl + "^Xl - ^Dl + ^Xl 
2m ""^ 2 ^ 2m ■"' 2 ^ 

This Hamiltonian acts on states of the form V'(xi, 2:2). If we denote by \n)\ the n-th 
harmonic oscillator state of the first particle (which is described by wave function 
with argument xi) then Hi\n)i = En\n)i. Similarly, -^21^)2 = En\n)2 for the 
second particle. Show that the state |n)i|m)2 is an eigenstate of H. What is the 
corresponding energy? What is the time evolution of this state? What is the anti- 
symmetrised version? Which states are forbidden by Pauli's principle? The solution 
to the combined system is so easy here because the Hamiltonian doesn't impose 
any interaction between the particles: the potential only depends on the particle's 
positions, not their relative distance. 

7. Consider two particles, but now with Hamiltonian: 
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with ai, a2 and K constants. What is the classical meaning of such a Haniiltonian? 
Hint: interpret the three potential terms. For which molecule might this Hamiltonian 
be a model? Here, the two particles are coupled: they feel each other as the energy 
depends on their relative position, so the solutions aren't simply products of the 
single-particle solutions anymore - so you don't have to solve the system here. 

8. In the exercises of Chapter 5, you computed the spectrum of particles in a three 
dimensional box. Here, we will look for the Fermi energy of such a system, when N 
(noninteracting) fermions are in such a potential with volume V. To find the Fermi 
energy, note that at T = 0, the N lowest energy states are occupied. Each state is 
characterized by a vector n = {nx,ny,nz), and the energy of a state is essentially a 
constant times the length of this vector. So roughly speaking, at zero temperature, 
all states which are filled have their ft inside a sphere with radius R. (At least, the 
region where all components Ux, ny, Uz are positive - one eight of a sphere that 
is.) By computing the sphere volume, estimate the number of states lying inside 
the sphere: A^(-R). Invert this to find the radius corresponding to N fermions at zero 
temperature. The energy of a state on this sphere is then the Fermi energy of the 
system. Prove that 



Chapter 12 

Metals, insulators and all that's in 
between 

In this chapter. . . 

In this last chapter, we wiU move up one step in scale, and derive properties of met- 
als (but also insulators and semiconductors) from the microscopical quantum laws. The 
understanding of these systems has an impact on our daily life which can hardly be over- 
estimated. An obvious example is the development of computers. These would still be 
limited to constructions of rotating wheels or vacuum tubes if not at some point the tran- 
sistor was invented - a simple but versatile application of the physics of semiconducting 
materials. Again, quantum mechanics is the gateway to truly understand the underlying 
physics. 
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12.1 [jzJ Particle statistics 

When describing large systems, one typically relies on statistical approaches. This is not 
only a useful technique, it is often the only way out. For example, a container of gas can 
in principle be described by the motion of all individual molecules, but this is so complex 
that it is not feasible in practice. What you do then, is describe how much of the molecules 
would on average have this or that momentum, be on this or that position, etcetera. You 
make statistical statements. This is the domain of Statistical Mechanics - on which may 
have had a course already. If you haven't, don't worry: we'll briefly review the concepts 
we need in this context. 

12.1.1 Population numbers 

A large collection of particles or molecules is typically described by a temperature. The 
notion of temperature is intimately connected to statistical properties. For example, if a 
(monoatomic, ideal) gas is at temperature T, the average energy per particle is given by 

(E) = hT (12.1) 

with k the Boltzmann constant. Not all individual particles have energy |A:T though: 
on the contrary, many particles will have a much larger or much lower energy. Again, the 
brackets (not to be confused with the quantum mechanical expectation value) denote that 
the above is only an average over the entire collection of particles. 

Luckily, the above can be refined a bit. Suppose that the states that can be taken 
by the individual particles form a discrete set, with corresponding energies Ei. Then the 
Boltzmann distribution states that the number Ui of particles in energy level Ei is 
given by: 

rii = -e-^ (12.2) 

The numbers rii are also called occupation numbers. This law should be understood 
as follows. If a the energy Ei of state i is big, then the exponential will be very small, 
so the state is only occupied by a few particles. States with lower Ei have a larger 
occupation number rii. The effect of high temperature is to make states of large energy 
more easily accessible, since the exponential suppression becomes less harsh. Again, the 
above expression is a statistical one: it expresses how many particles will on average be 
in Ei if the system as a whole is at temperature T. In particular, rii does not have to 
be an integer. The proportionality constant Z has to be chosen such that Yli ^« equals 
to total number of particles. This constant can be ignored if one only considers ratio's of 
population numbers: 



rij g-Ej/kT 



:i2.3) 
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Figure 12.1: On the left a system at temperature T with several energy levels and some 
particles is shown. The specific arrangement is not determined by the temperature: the 
only thing we can talk about is the average occupation number of each level of this system. 
This is given by a function n{E): the Botzmann distribution. Here, we have inverse-plotted 
the function to highlight the similarity to the figure on the left. 



Bosons and fermions 

From the previous chapter, we know that particles can not always be in one state together. 
This is only the case for bosons. A fermionic state can either be occupied or empty, 
nothing else. A more careful analysis for those cases shows that for bosons and fermions 
the Boltzmann distribution has to be replaced by: 



1 



m 



e kT lb 1 



:i2.4) 



with ;U is a constant, explained below. The lower sign (minus) is to be used for bosons. Such 
a distribution of particles over states is called Bose-Einstein-statistics. The upper sign 
(plus) is to be used for fermions. Such a distribution is called Fermi-Dirac statistics. 
Clearly, in that case all rii are smaller than one, as should be for fermions. 

The number ^ in the above expression is again a normalization constant - although 
it has now been absorbed into the exponential. It is to be taken such that the sum of 
all occupation numbers equals the number of particles of the system. It is called the 
chemical potential. It is interesting to see what happens at low temperatures. First, 
consider bosons. Denote the lowest energy level by £"0- Of course, we need /U < Eq, to 
ensure that all occupation numbers are positive. If you now let T — )• and also /j, ^ Eq, 
then the number of particles in the lowest energy state (no) becomes large (the denominator 
becomes 1 — 1 since /U — )■ Eq) while all others (rii, rii, ... ) go to zero (the denominator 
becomes 00 — 1 because T — )■ 0). This means all particles settle in the lowest energy state. 
This phenomenon is called Bose-Einstein condensation. This interesting phenomenon 
is only possible because bosons can all sit in the same state. Now, consider fermions. You 
can check that in the limit T — )■ 0, the occupation numbers n^ — )■ 1 for all energy levels 
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with Ei < jjL and that rij — )• for ah Ei > jjl. This is not so strange: at low temperatures, 
the fermions seek the lowest energy configuration: stacking all lowest energy levels with 
one particle each is the best you can do. 

Low occupation limit 

You may ask yourself what the use of ()12.2p is if we have ()12.4p . After all, every particle 
(even an entire atom or molecule!) is either a boson or a fermion. Why don't we see that 
a gas consists of bosons or fermions? From the above distributions you might infer that 
both cases behave drastically different. The answer is that for most gases the average 
occupation number of a state is extremely low. Not because there are few particles - on 
the contrary - but there is an even more overwhelming amount of states. Hence, all Ui will 
be very small. This can only be due to the fact that e(^»~^)/'^'^ is very large, so for all these 
systems the ±1 in the numerator becomes negligible, and (|12.4p neatly reduces to ()12.2p . 
Phrased differently: only if there are very few states available (^ high, i.e.: comparable to 
particle energies) or if the temperature is very low (small e '' ) the difference between 
bosons and fermions becomes visible. For typical gas systems neither is the case, so the 
gas is well described by the ordinary Boltzmann distribution. So far the Stat-Mech recap, 
time for the story... 

12.2 IS Metals, insulators and all that's in between 

In this course, we have met two types of systems: hound particles trapped in a potential, 
and free particles which can move around freely - possibly meeting some obstacles. In the 
first case, the spectrum typically consists of a discrete set of states that can be taken. In 
sharp contrast, the spectrum of a free particle is continuous: by adjusting its momentum 
one can in principle give it any energy. Here, we meet a third type of systems: crystals. 
By a 'crystal' we mean a solid in which the constituting atoms are arranged in a periodic 
and ordered way (a lattice). Electrons in such an environment are surely bound: they 
can not just walk out of the crystal. However, they are not strictly attached to one 
single atom: within the grid, they can freely move around. In some sense this puts 
crystals somewhere in between free and bound systems. And this is also reflected in the 
spectrum. For a crystal, the energy states that can be occupied by the electrons consist 
of several continuous regions, energy bands, which are separated from each other by 
regions without accessible states: band gaps. An example is shown in Figure 112.21 
There, electrons can sit in energies in three bands. The bands are separated by two gaps. 
There are no states of which the energy lies in this gap regions, so no electron can occupy 
a state with that energy - they are 'forbidden' energy zones. 
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Figure 12.2: The typical allowed energy levels for a crystal. Several continuous energy 
regions can be filled with electrons: these are the energy bands. In between them, there 
are no states available: these are the gaps. 



Level filling 

We just explained that every crystal structure has energy bands and band gaps associated 
to it. The placement of these regions on the energy spectrum depend on several things: 
the atoms making up the crystal (their atomic numbers) the spatial arrangement of the 
atoms (the lattice type) and possibly also impurities. It are these data that decide where 
the energy bands and band gaps lie. But the location of these energy regions is not the 
end of the story. An important property is how far these energy levels are filled up by the 
electrons present in the crystal. Once again, we note that the exclusion principle prevents 
any two electrons from sitting in the same energy level. Hence, the best electrons can do 
is filling up all the lowest energy states available, not putting any two in the same state. 
Pictorially speaking, the electron filling defines a surface: the highest energy level that has 
to be occupied, given that all levels below have already been filled. This surface is called 
the Fermi surface, and the corresponding energy is the Fermi energyl^ Now we have 
all necessary ingredients to classify into three large categories: conductors, insulators and 
semiconductors. We review them in that order. 

12.2.1 Conductors 

Conductors are materials that easily produce an electric current when an external potential 
is applied. What is the mechanism behind this current? As follows: the electrical field 
gives some electrons an extra energy, they are accelerated, and lifted up to a slightly 
higher energy level. These electrons of a slightly higher energy tend to have a momentum 
direction determined by the electric field. Hence they give rise to a macroscopic current. 
However, it is not always possible for an electron to take on a slightly higher energy 
level. To do so, there needs to be a vacant state available, right above the original state 
of the electron. Clearly, the electrons occupying low energy states can not do this: all 



'^Combining with the tool part, you may remark that a low temperatures these should tend to the 
chemical potential. 
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states above them are already taken. So these electrons do not take part in the process of 
conduction. For electrons near the Fermi surface, the situation is different. If the Fermi 
surface lies somewhere inside an energy band then the electrons there can jump up to 
higher (unoccupied) energy states. In this scenario, the material can conduct a current 
very well: it is a conductor. This case is shown on the left in Figure [12.31 

12.2.2 Insulators 

We just explained that if the Fermi surface lies somewhere inside an energy band, the 
electrons near that surface can jump up in energy and conduct a current. Another pos- 
sibility that can occur, is that the Fermi surface lies at the top of an energy band. This 
energy band is then completely filled, and is called the valence band. In such a case, the 
lowest vacant energy states lie much higher: at the bottom of the next energy band called 
the conduction band. So electrons need to jump up over the entire band gap to reach 
those states. If the band gap is wide, the applied potential is typically insufficient for a 
significant number of electrons to cross the band. Hence, the current will consist of only 
very few (or even none) of the electrons. Such a material does not conduct well, or even 
not at all: it is an insulator. 
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Figure 12.3: The different types of crystals according to their electrical properties. Left: 
a conductor has a broad band, in the middle of which lies the Fermi surface. This leaves 
a lot of available states open right above the occupied states. Such a situation easily 
conducts an electric current. Middle: when a large gap separates the (filled) valence band 
from the (empty) conduction band, electrons can hardly overcome that gap. This means 
few or electrons will participate in the current: the material is an insulator. 



12.2.3 Semiconductors 

The third category of materials is a bit of an in-between. The Fermi surface again lies at 
the top of an energy level, but now the valence band and the conduction band are much 
closer together. The energy gap to be crossed is then much smaller. Here, the role of 
temperature becomes very crucial. In the beginning of this chapter we met an expression 
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Figure 12.4: The Fermi-Dirac distribution for different temperatures. At zero temperature, 
the system takes on the lowest possible energy state. This means all states below some 
/x are filled and all higher states are empty. When raising the temperature, thermal 
excitations start to show up: some electrons migrate to higher energies. The distribution 
smoothly transitions to a more spread out distribution, the tail of which has a e~ *' 
shape. 



for the occupation of states for a collection of Fermions at some temperature T, which can 
be rewritten as follows: 
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The shape of the distribution at several temperatures is shown in Figure 112.41 

Back to the band gap of a semiconductor. Since the occupation numbers are low in the 
conduction band, the statistics can be safely approximated by the Boltzmann distribution. 
Indeed: {E — fi) >> kT there, whence 
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Denoting the size of the energy gap by AE, we find that the occupation number ricond of 
the conduction band is proportional to: 
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since (E — fi) ~ AE for low-lying energy levels in that band. This factor strongly depends 
on the temperature, so we conclude that the conductivity of the semiconductor is varies 
strongly with temperature. Also, note that for each electron in the conduction band, a 
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hole is left in the valence band. In the background of the lattice, this hole looks like a 
positive charge, and hence it can take part in the conduction too. 

Besides the temperature, the conductivity of a semiconductor is strongly influenced 
by impurities. That is because some impurities donate freely moving electrons (or holes). 
Since only very few electrons of the semiconductor take part in the conduction, the ad- 
dition of a small amount of impurities may easily donate an equally large or much larger 
amount of available electrons. So adding impurities allows to manipulate the properties 
of semiconductors strongly. This is the underlying reason of the widespread use of semi- 
conductors in electronics. In particular, simple semiconductor based applications like the 
transistor have revolutionized our world and this has even lead some people to state it is 
the single most important invention of the 20*'* century. Again, the physics underlying 
this invention would not have been understood without the advent of quantum mechanics. 
Time for an example... 



12.3 HH A simple model 



Striking enough, we can obtain a (semi-realistic) example of a system containing energy 
bands and gaps without all too much effort, by the so-called Kronig- Penney model. As 
we explained, a crystal is a solid material in which the atoms are arranged in a periodic 
and ordered fashion. By consequence, the Coulomb potential felt by electrons is periodic 
as well. We now mimic such a periodic potential by V{x) as shown in Figure 112.51 The 
potential is zero over a distance 2a, peaks at some V{x) = V over a distance b, and this 
is repeated. The period of the potential is given by c = 2a -|- 6: 

V{x + c) = V{x) (12.8) 

We now introduce a translation operator T, which is shifts any function horizontally over 
a distance c: 

Tf{x) = f{x + c) (12.9) 

For example TV{x) = V{x + c) = V{x). Using this and the fact that ^ = g/^,^\ (which 
follows from the chain rule) we get 

THi;ix) = r(-|^^ + y(x))^(x) (12.10) 

+ V{x + c)]'il^{x + c) (12.11) 



2m d{x + c)^ 

+ y(x))V(x + c) (12.12) 



2m dx"^ 
HTi)[x) (12.13) 
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V(x), 



JL_E 



V- 



tLJL 



-2 a- 



Figure 12.5: A periodic potential. The value is alternating between V{x) = and V{x) 
V. The width of the peaks is b and they are separated by a distance 2a. 



So [H,T] = 0. This means that if I'i/') is an eigenstate of H with eigenvalue E, the state 
T\ip) will be so too, and with the same eigenvalue: 



H{T\ip)) = TH\ip) =TE\ 



E{Tm. 



(12.14) 



If we assume that there is only one state for each energy, \iIj) and T\tjj) must be the same 
physical states. So their wave functions ^{x) and ip{x + c) must be equal up to a phase 
factor: 



V'(x) 



rV'(x) = e'''i)[x + c) 



(12.15) 



The equation ^{x) = e^°'ip{x + c) is very restrictive. It can be shown that it can only be 
satisfied for functions of the following form: 



Tp{x) 



AKx 



u{x) 



(12.16) 



with u{x) a periodic function. Wave functions of this kind are called Bloch waves. They 
are not necessary periodic over a distance c, since the 'rotating' part e*^^' can have any 
periodicity (not necessary c). Also note that the above equation implies that 



V'(x + c) 



AKc 



ip{x). 



(12.17) 



Now, let us put a particle of energy E in the potential. For concreteness, take E < V, the 
other case works out in a similar fashion. Defining 



2m _^ ^, , , 9 2m ^ 

j^iV-E) and k' = j^E 



the solution in region 1 (see Figure [T2.5P is given by: 

■01 (x) = Aie'^'' + Bie~'^'' for 
= yl2e*''^ + ^26"*''^ for 



b — a < x < —a 
- a < X < a 



(12.18) 



(12.19) 
(12.20) 
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From the above and ()12.17p it follows that the solution in region 2 looks like: 



iIJ2{x) 



JKc 



iKc 






for a<x<a + b (12.21) 

for a + b<x <a + c (12.22) 

-a and 



Demanding continuity of the wave function and its derivative at the points x 
X = a gives the following four constraints: 

Aie-""" + Bie'"' = A^e 



-ika I 13 ika 



A2e''"' + B2e-'^'' 



ikA2e-'^'' - ikB2e''"' 



JKc 



ik 



Aoe 



ika 



B2e 



ika 



e'^'K 



Aie"^"-"^ + Bie-"^''-'''^ 



(12.23) 
(12.24) 

(12.25) 
(12.26) 



This is a homogenous system of four equations in the unknowns Ai, i?i, ^2, ^2- The 
condition to have solutions is thus 



iKc K(a—c) 



— Ke 
iKc —K(a—c) 



-Ke 



iKCpK(a—c) 



Ke 



iKCp—K{a—c) 



p—ika 

-ike-^^" 

ika 
il^^ika 



pika 

ike'^"" 

—ika 



(12.27) 



To evaluate the determinant, we simplify it a bit. First divide away the factors e 



-ika 



and — e*'^" from the four rows, and use 2a — c = —h to simplify some exponents. 
The first row then looks like (1, 1, 1, 1). Then put (column 1+ column 2)/2 and (column 
1 - column 2)/2 instead of the original columns 1 and 2, and similarly for the columns 3 
and 4. Taking away a last factor i from column 4, we get: 



1 10 

K fc 

-e"'"cosh(/i;6) e*^'^ sinh(K6) — cos(2/ca) —sui2ka 

Ke sinh(K6) —Ke cosh. Kb ksm{2ka) —kcos{2ka) 



iKc, 



(12.28) 



A lot of zeros have popped up! By expanding this as a sum of cofactors of the elements of 
first row, the above is just a sum of two 3-by-3 determinants. These are not too hard: they 
each lead to four terms. Collecting terms and simplifying expressions like sin'^ 0+cos'^ 6 = 1 
and cosh x — sinh x = 1, the whole thing becomes less and less ugly. The end result reads: 

^2 _ ^2 
cos(2A;a) cosh(K&) -\ — sin(2A;a) sinh(K&) = cos(ii'c) (12.29) 

So to recap: for the system to have smooth solutions at energy E, the above equation 
needs to be satisfied. Since the right hand side lies between -1 and 1, the left hand side 
must be so to in order to allow solutions. In conclusion, we need 



1 < f{E) < 1, 



(12.30) 
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energy bands 



Figure 12.6: The function f{E) for a specific value of a and b. There are no states with 
energy E in the regions where \f{E)\ > 1. So these zones are gaps, as indicated on 
the figure. In the regions where \f{E)\ < 1 there are energy states available, so these 
constitute the energy bands. 



with 



f{E) = cos{2ka) cosh{Kb) 



e 



2Kk 



sin(2A;a) sinh(K6) 



(12.31) 



The function f{E) is shown in Figure [12.61 for some arbitrary values of the parameters a, h 
and V . It is immediately clear that there are genuine energy bands and energy gaps. This 
behavior occurs rather generic: the parameters a, h and V need not to be finely tuned 
to arrive at pictures similar to the Figure 112.61 So even for a very simplistic potential 
we obtain a semi-realistic energy spectrum. One can upgrade this model by taking more 
Coulomb-like potentials (of which the charge is partly shielded by the other electrons) by 
making the model three-dimensional and by including imperfections of the crystal. So 
with the very same principles, one can obtain even more realistic results. 
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Exercises 

1. Explain that the electrons crystals are somewhere in between bound state and free. 
How is this reflected in the energy spectrum? 

2. It is sometimes stated that a conductor is a limiting case of a semiconductor, where 
the valence and conduction band are right on top of each other, without gap. Ex- 
plain. 

3. The surface of the sun is at about 5800K. For every hydrogen atom in the 2s state, 
how many atoms can you find in the Is state? The Boltzmann constant is A; = 
8.62 • 10~^ eV/K= 1, 38 • lO'^^ J/K. 

4. Redo the periodic potential, but now for the case E > V. (There is a shortcut 
to obtain this result: ii E > V, then k becomes imaginary. Putting k = ik2 and 
using sinh(ix) = i sin x and cosh(ix) = cos x you should be able to find the analog 
of (|12.29p in terms of k and k2- This should correspond to what you found in the 
'brute' way.) 

5. For the Kronig-Penney model, we found the energy spectrum. Recall that the states 
are also characterized by their momentum k. (Just like for a free particle, the energy 
is a function of the wave number.) So actually, the spectrum can also be depicted 
as lines in a E — k diagram, instead of just bands on an energy axis. Look at Figure 
()12.7p : the set of allowed states are lines in this diagram. The dots denote states 
(actually, they form a continuous spectrum, but well) and a white dot denotes an 
empty state. How do you see the energy gap on the pictures? The material shown 
is an N-type semiconductor: a semiconductor doped by a material which donates 
some extra electrons to the lattice. The occupation of states is shown for the same 
material, in three different situations. Explain which physical situations you see on 
the three different pictures. In the situation where a net current is fiowing (due to 
an external E-field) in which direction is it going? 




^♦^►"•^^ ^OOO^ •♦^*^^s» 

^^P ^'A A^^ ^^A A'^ ^^A 

Figure 12.7: An N-type semiconductor in different physical situations. 



Outro 



That's it! This is the end of the course. You can now say that you know quite a bit about 
quantum mechanics. When you think back, you may reahze that quantum mechanics is not 
just a funny and strange theory invented by even stranger physicists. It is an accurate and 
comprehensible theory about nature on its smallest scale. Moreover, it is the beating heart 
of the reality as we see it. Indeed, atoms, molecules, crystalline solids, radiation: all these 
crucial ingredients of nature are very strongly determined by the quantum mechanical 
laws that govern their behavior. Also, the role of quantum mechanics in technological 
applications is much bigger than most people think. Lasers, microprocessors, precision 
thermometers and STM microscopes: none of these inventions could have been possible 
without understanding the laws quantum mechanics. Hopefully you have a more broad 
vision on this now, but far more important: I hope you have enjoyed this textbook. 
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Bonus track: philosophical note 



12.4 Quantum mystery and God's dice 

In the outro, emphasis was put on the practical value of quan- 
tum mechanics in quite some technological applications. This 
is a fact which seems to be a bit ignored by popular culture. 
On the contrary, to a large public, quantum mechanics has 
a somewhat mysterious or even transcendental status. This 
despite the fact that the theory came to life at the very be- 
ginning of the 20th century - the absolute prehistory on the 
scientific and technological timescale. So where does this eso- 
teric status of quantum mechanics come from? An important 
element in this quantum-mystery is the widespread idea that 
quantum mechanics is not a deterministic theory. This con- 
trasts to more or less every other theory in physics, and would 
surely be an uttermost mysterious conclusion of one of the most refined theories of nature. 
Let us think about this more carefully. Indeterministic ... where did we see that word 
popping up in this course? For sure there are no 'unknown' parameters in the Schrodinger 
equation. Whether we are talking about one particle, or a bunch of them, or even an 
entire lump of matter, the rules of quantum mechanics give a very clear and deterministic 
prescription on how this system will evolve in time. If you know the system's Hamiltonian, 
you can solve the time evolution, and say what it will look like at any later moment in 
time. So in that aspect, quantum mechanics is not even slightly different from -say- the 
Newtonian description of point particles. It is only a bit more complex, and a lot more 
accurate. 

'Ah, but what about the measurement', you say. We did learn indeed that measuring 
the position of a particle for example yields a more-or-less unpredictable outcome. The 
answer will typically be some value lying inside the approximate location of the wave func- 
tion, but we can never predict what the exact outcome will be. Back then, we summarized 
this by the statement that a measuring device gives a 'reasonable, but random answer'. It 
is this aspect of the theory which people refer to when they say that quantum mechanics 
is indeterministic. And it is a puzzling feature indeed, which has caused a lot of confusion 
over the decades. A famous quote of Einstein in this context is: 'God does not play dice'. 
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This summarizes his and many others' dissatisfaction with the issue. 

So when considering the issue of determinism, it seems Uke quantum mechanics shows 
two faces. On one hand, every system (small or big) evolves perfectly deterministic by the 
Schrodinger equation, on the other hand, the act of measurement introduces a random 
outcome, and collapses the system onto the state that is was measured to be in. So 
measuring devices seem to play a very special role here. Of course this is a bit paradoxical. 
If the laws of quantum mechanics are correct (and there are very good reasons to believe 
this) also measuring devices have to behave according to those laws. So they too, need to 
behave deterministic. What is going on here? 

In principle, the issue is almost of mathematical clarity. Either measuring devices 
evolve according to the deterministic laws of quantum mechanics, or they have a special 
status that allows them to give fundamentally indeterministic answers. The problem is of 
course that studying the behavior of a measuring device on the basis of quantum mechanics 
is a horrible problem: it is practically impossible to do any realistic computation for such 
a large and complex system. There are some attempts to do so, but not with all too great 
success - and it will probably stay like that for a while. 

So the problem of deciding whether or not quantum mechanics is a deterministic theory 
is essentially one of computational complexity. Actually, there is quite a lot to this phrase 
of Einstein. What he refers to is of course that it seems unnatural for Goqj to include 
'random' events in the evolution of any system - as if he was throwing dice for every 
measurement. Funny enough, when you think about dice, you may remark that these 
(depite being our most universal symbol for randomness) do not behave random either. 
They look like they do, but you know they just obey Newton's laws of mechanics. Let us 
illustrate this with a discussion between Newton and a gambler. 

Gambler: Hey Newton, do you have any fancy theory to predict the outcome 
of my dice? If you have, I'll learn physics right away! 

Newton: Well, actually, I have. They fall, bounce and twist just according to 
my laws of mechanics. 

Gambler: Oh, really? Can you predict what I will get right now? (Throws 
dice.) Aha, a double six! did you see that one coming? 

Newton: Not really. In principle, if you would give me all data, the momentum 
you gave the dice, their shape and that of the surface they land on, I could 
predict the outcome. But even with all the right data, it's a very nasty com- 
putation. Probably even harder than computing orbits of planets. 
Gambler: (disappointed) So actually, you can not say anything interesting 
about it. 

Newton: Well, yes. There are probability laws. These will tell you that you 
are much more likely to throw a total of seven than a total of twelve, which 
you got just now. These laws are of great value, even though they are not able 
to predict any individual outcome. 



^a term that he used as a metaphor for the most fundamental laws of nature 
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Gambler: Ah, I see. So I better take a course on statistics then? 
Newton: If you want to be a better gambler, yes. Good luck with it! 
Gambler: Thanks, Sir Newton, good luck with your laws! 

In a very similar way, you may believe that measuring devices too behave deterministico 
Only in the very end, the dazzlingly complex behavior gives rise to a relatively simple 
probabilistic law: P ~ |V'P- But how could such a simple law be the end result of such 
a huge complexity? Well, after all a measuring device is a very special system, designed 
to do a very special job. In the same way, a dice is precisely designed to give a nice equal 
chance distribution resulting from its dynamics - despite the fact that it was probably 
not designed with the active purpose to 'develop a solid object, of which the dynamical 
and elastic characteristics are such that the final stage after a throw on a smooth surface 
ends up in an almost perfectly homogeneous probability distribution over the possible 
outcomes'. In the same way, the simplicity of the probabilistic law of measurements in 
quantum mechanics may be the result of 'accidental' smart design of the instrumentO 

12.5 A jungle of interpretations 

So far a partly demystifying view on the whole subject. However, don't think there is 
a consensus on this matter though. There is a jungle of possible interpretations of the 
physics of the measurement in quantum mechanics. Just check out 'interpretation of 
quantum mechanics' on any search engine, and this jungle opens up. There are roughly 
three categories of interpretations: 

• Indeterministic theories. These take the process of measurement as an indication 
that quantum mechanics is intrinsically non-deterministic. This leads to very deep 
questions on what reality is, what an observer is and what connects them. Probably 
the most hard and (sorry for the word) confusing interpretation, these theories seem 
to dominate the view on the matter presented to the large public. Without doubt 
the origin of the widespread mystery surrounding quantum mechanics. 

• Indifferent theories. These do not even pose the question whether quantum me- 
chanics is deterministic or not, and categorize the issue as irrelevant and philosoph- 
ical. Here, the arguments (quite rightful) are that quantum mechanics is a perfectly 
healthy theory. It is consistent, allows to describe systems and the (be it probabilis- 
tic) outcomes of measurements. Where this probabilistic law comes from, is not a 
question the physicist should ask him- or herself. A quote supporting this: 'Shut 
up and calculate.' If you are not fond of philosophy, this might be your favorite 
viewpoint. 



^This similarity was very likely not intended by Einstein's phrase, but the possible analogy between 
the underlying mechanisms is rather striking. 
"^No intelligent design pun intended. 
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• Deterministic theories. These assume that the process of measurement is governed 
by the same quantum laws that describe other systems. Some arguments for this 
view were presented above. Conceptually, this is probably the least confusing option. 
(Unless you are more dogmatic and like 'indifferent theories' better.) However, the 
question on how this precisely works (i.e. how the probability rule emerges exactly) 
is a very hard one, and remains largely unanswered. 

As said, each of the categories comprises numerous theories and variations, and prob- 
ably equally many of them do not fall strictly into one of the three types outlined here. 
Maybe you will develop a view of your own one day. Feel free to do so, the interpretation 
of the measurement of quantum mechanics is one of the (many) open problems of physics. 
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